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PREFACE 


An American student approaching the higher parts of mathe- 
matics usually finds himself unfamiliar with most of the main facts 
of algebra, to say nothing of their proofs. Thus he has only a 
rudimentary knowledge of systems of linear equations, and he knows 
next to nothing about the subject of quadratic forms. Students in 
this condition, if they receive any algebraic instruction at all, are 
usually plunged into the detailed study of some special branch of 
algebra, such as the theory of equations or the theory of invariants, 
where their lack of real mastery of algebraic principles makes it 
almost inevitable that the work done should degenerate to the level 
of purely formal manipulations. It is the object of the present 
book to introduce the student to higher algebra in such a way that 
he shall, on the one hand, learn what is meant by a proof in algebra 
and acquaint himself with the proofs of the most fundamental facts, 
and, on the other, become familiar with many important results of 
algebra which are new to him. 

The book being thus intended, not as a compendium, but really, 
as its title states, only as an introduction to higher algebra, the 
attempt has been made throughout to lay a sufficiently broad founda- 
tion to enable the reader to pursue his further studies intelligently, 
rather than to carry any single topic to logical completeness. No 
apology seems necessary for the omission of even such important 
subjects as Galois’s Theory and a systematic treatment of invariants. 
A selection being necessary, those subjects have been chosen for 
treatment which have proved themselves of greatest importance in 
geometry and analysis, as well as in algebra, and the relations of 
the algebraic theories to geometry have been emphasized throughout. 
At the same time it must be borne in mind that the subject primarily 
treated is algebra, not analytic geometry, so that such geometric 
information as is given is necessarily of a fragmentary and some- 
what accidental character. 

No algebraic knowledge is presupposed beyond a familiarity with 


elementary algebra up to and including quadratic equations, and 
W 
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such a knowledge of determinants and the method of mathematical 
induction as may easily be acquired by a freshman in a week or 
two. Nevertheless, the book is not intended for wholly immature 
readers, but rather for students who have had two or three years’ 
training in the elements of higher mathematics, particularly in 
analytic geometry and the calculus. In fact, a good elementary 
knowledge of analytic geometry is indispensable. 

The exercises at the ends of the sections form an essential part 
of the book, not merely in giving the reader an opportunity to think 
for himself on the subjects treated, but also, in many cases, by sup- 
plying him with at least the outlines of important additional theories. 
As illustrations of this we may mention Sylvester’s Law of Nullity 
(page 80), orthogonal transformations (page 154 and page 173), and 
the theory of the invariants of the biquadratic binary form (page 260). 

On a first reading of Chapters I-VII, it may be found desirable 
to omit some or all of sections 10, 11, 18, 19, 20, 25, 27, 34,35. The 
reader may then either take up the subject of quadratic forms 
(Chapters VIII-XII]), or, if he prefer, he may pass directly to the 
more general questions treated in Chapters XI V—XIX. 

The chapters on Elementary Divisors (XX—X XII) form decid- 
edly the most advanced and special portion of the book. A person 
wishing to read them without reading the rest of the book should 
first acquaint himself with the contents of sections 19 (omitting 
Theorem 1), 21-25, 86, 42, 43. 

In a work of this kind, it has not seemed advisable to give many 
bibliographical references, nor would an acknowledgement at this 
point of the sources from which the material has been taken be 
feasible. The work of two mathematicians, however, Kronecker 
and Frobenius, has been of such decisive influence on the character 
of the book that it is fitting that their names receive special men- 
tion here. The author would also acknowledge his indebtedness 
to his colleague, Professor Osgood, for suggestions and criticisms 
relating to Chapters XIV—XVI. 

This book has grown out of courses of lectures which have been 
delivered by the author at Harvard University during the last ten 
years. His thanks are due to Mr. Duval, one of his former pupils, 
without whose assistance the book would probably never have been 
written. 
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INTRODUCTION TO HIGHER ALGEBRA 


CHAPTER I 


POLYNOMIALS AND THEIR MOST FUNDAMENTAL 
PROPERTIES 


1. Polynomials in One Variable. By an integral rational func- 
tion of x, or, as we shall say for brevity, a polynomial in z, is meant 
a function of 2 determined by an expression of the form 


(1) Cyr + Cy 0% + +e + 6, U%, 


where the «’s are integers positive or zero, while the e’s are any con- 
stants, real or imaginary. We may without loss of generality 
assume that no two of the a’s are equal. This being the case, the 
expressions ¢% are called the terms of the polynomial, ¢; is called 
the coefficient of this term, and «; is called its degree. The highest 
degree of any term whose coefficient is not zero is called the degree 
of the polynomial. 

It should be noticed that the conceptions just defined — terms, 
coefficients, degree —apply not to the polynomial itself, but to the 
particular expression (1) which we use to determine the polynomial, 
and it would be quite conceivable that one and the same function of 
x might be given by either one of two wholly different expressions 
of the form (1). We shall presently see (cf. Theorem 5 below) 
that this cannot be the case except for the obvious fact that we 
may insert in or remove from (1) any terms we please with zero 
coefficients. 

By arranging the terms in (1) in the order of decreasing @’s and 
supplying, if necessary, certain missing terms with zero coefficients, 
we may write the polynomial in the normal form 


(2) Aye” + a0" + + + Ay 12+ Ay, 
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It should, however, constantly be borne in mind that a polynomial 
in this form is not necessarily of the nth degree; but will be of the 
nth degree when and only when a, #0. 


DEFINITION. Two polynomials, f,(x) and f,(x), are said to be 
identically equal (f,=f,) of they are equal for all values of vz. A 
polynomial f(x) is said to vanish identically (f=0) of tt vanishes for 
all values of x. 


We learn in elementary algebra how to add, subtract, and multi- 
ply * polynomials; that is, when two polynomials f,(z) and f,(z) are 
given, to form new polynomials equal to the sum, difference, and 
product of these two. 


THEOREM 1. Jf the polynomial 
F(@) = aya" + ajar t+ +a, 
vanishes when x = a, there exists another polynomial 
(£2) = aya" + aja” * 4+ .-- + al _,, 

such that J(«) =(«@— «)$, (2). 

For since by hypothesis f(a) = 0, we have 

F(@)=f(2) —fla)= Ay (x" — a”) + a, (a1 — a’) + +++ + @,_3(@ — @). 

Now by the rule of elementary algebra for multiplying together 

two polynomials we have 


a — oF =(e— a) (a + ook? oo. a). 
Hence 


F(@) = (@ = o) [ag (a! + aa? 4 4 a?) + a, (a"* + aa 4 
ote a?) =e eral 


If we take as $,(x) the polynomial in brackets, our theorem is 
proved. 


Suppose now that 8 is another value of x distinct from « for which 
F(©) is zero. Then . 
S(B) = (8B — «) p(B) = 05 


* The question of division is somewhat more complicated and will be considered 
in § 63. 
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and since B—«+#0, $,(8)=0. We can therefore apply the theo- 
rem just proved to the polynomial ¢,(«), thus getting a new 
polynomial by (x) = ty2"? + afar 4 Lally 


such that $;(x)=(«#— B)¢,(2), 
and therefore J (x) = (a— a)(x — B)¢,(z). 


Proceeding in this way, we get the following general result: 


THEOREM 2. If a, a, +++ a, are k distinct constants, and if 


F(@) = aya" +a a" 1+ «+a, (n=k), 
and Flu) =F) = + = Flax) = 0, 
then F(a) = (@ — ay) (@ — ay) +++ (@ — &) (2), 
where b(2) = ap arg’ Heb, a Fos 4 8, y 


Applying this theorem in particular to the case n = k, we see that 
if the polynomial f(x) vanishes for n distinct values 0, a, +++ @, of 2, 


then F(a) = a)(" — 0) (2 — ay) + (x — On)» 


Accordingly, if a, #0, there can be no value of w other than Oy? On 
for which f(x)=0. We have thus proved 


THEOREM 3. A polynomial of the nth degree in x cannot vanish 
Sor more than n distinct values of x. 


Since the only polynomials which have no degree are those all of 
whose coefficients are zero, and since such polynomials obviously 
vanish identically, we get the fundamental result : 


THEOREM 4. A necessary and sufficient condition that a polyno- 
mial in x vanish identically is that all its coefficients be zero. 


Since two polynomials in 2 are identically equal when and only 
when their difference vanishes identically, we have 


TuroreM 5. A necessary and sufficient condition that two polyno- 
mials in x be identically equal is that they have the same coefficients. 

This theorem shows, as was said above, that the terms, coefficients, 
and degree cf a polynomial depend merely on the polynomial itself, 
not on the special way in which it is expressed. 
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2. Polynomials in More than One Variable. A function of (x, y) 
is called a polynomial if it is given by an expression of the form 


cyany?s + Cy grnyhe fone + Cy U7HY Phy 


where the a’s and #’s are integers positive or zero. 
More generally, a function of (271, 2, --- %,) is called a polynomial 
if it is determined by an expression of the form 


(1) 6, 0,20, vee OME Ce he i ie eS CL" oPh o28 Lng 


where the a’s, f’s, -:- v’s are integers positive or zero. 
Here we may assume without loss of generality that in no two 
terms are the exponents of the various 2’s the same ; that is, that if 


a; = A, B= Bi, sh ee 


This assumption being made, ¢;7,%x,%--- x,” is called a term of the 
polynomial, ¢; its coefficient, a; the degree of the term in 2x, B; in Xp, 
etc., and a,+6;+-:-+4; the total degree, or simply the degree, of 
the term. The highest degree in 2, of any term in the polynomial 
whose coefficient is not zero is called the degree of the polynomial 
in x; and the highest total degree of any term whose coefficient is 
not zero is called the degree of the polynomial. 

Here, as in § 1, the conceptions just defined apply for the present 
not to the function itself but to the special method of representing 
it by an expression of the form (1). We shall see presently, how- 
ever, that this method is unique. 

Before going farther, we note explicitly that according to the 
definition we have given, a polynomial all of whose coefficients are zero 
has no degree. 

When we speak of a: polynomial in n variables, we do not nee- 
essarily mean that all n variables are actually present. One or more 
of them may have the exponent zero in every term, and hence not 
appear at all. Thus a polynomial in one variable, or even a con- 
stant, may.be regarded as a special case of a polynomial in any 
larger number of variables. 

A polynomial all of whose terms are of the same degree is said 
to be homogeneous. Such polynomials we will speak of as forms,* 

* There is diversity of usage here. Some writers, following Kronecker, apply the 


term form to all polynomials. On the other hand, homogeneous polynomials are often 
spoken of as guantics by English writers. 
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distinguishing between binary, ternary, quaternary, and in general, 
n-ary forms according to the number of variables involved, binary 
forms involving two, ternary three, ete. 

Another method of classifying forms is according to their 
degree. We speak here of linear forms, quadratic forms, cubic 
forms, etc., according as the degree is 1, 2, 8, etc. We will, how- 
ever, agree that a polynomial all of whose coefficients are zero may 
also be spoken of indifferently as a linear form, quadratic form, 
cubic form, ete., in spite of the fact that it has no degree. 

Tf all the coefficients of a polynomial are real, it is called a real 
polynomial even though, in the course of our work, we attribute 
imaginary values to the variables. 

It is frequently convenient to have a polynomial in more than one 
variable arranged according to the descending powers of some one 
of the variables. Thus a normal form in which we may write a 
polynomial in m variables is 


Poa - ©) ay" + pyar ae Vn) ay" tes Pink Xap at Lp)s 
the ¢’s being polynomials in the n — 1 variables (,, --- x,). 
We learn in elementary algebra how to add, subtract, and multiply 
polynomials, getting as the result new polynomials. 


DEFINITION. Two polynomials in any number of variables are 
said to be identically equal if they are equal for all values of the vari- 
ables. A polynomial is said to vanish identically if it vanishes for 
all values of the variables. 


THEOREM 1. A necessary and sufficient condition that a polyno- 
mial in any number of variables vanish identically is that all cts coeffi- 
cients be zero. 

That this is a sufficient condition is at once obvious. To prove 
that it is a necessary condition we use the method of mathematical 
induction. Since we know that the theorem is true in the case of 
one variable (Theorem 4, § 1), the theorem will be completely proved 
if we can show that if it is true for a certain number n — 1 of vari- 
ables, it is true for n variables. 

Suppose, then, that 


Fey + 5) = bo ty “2 AEA + Gal ty °* By) m+ + by (Bay By) 


vanishes identically. If we assign to (a, --- 2,) any fixed values 
(x), --- a), f becomes a polynomial in 2 alone, which, by hypothesis, 
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vanishes for all values of x,. Hence its coefficients must, by Theorem 
4, § 1, all be zero: 


di (th, °° &,) = 0 (ces Ope” Uy 
That is, the polynomials ¢), 4; °°: @m vanish for all values of the 
variables, since (2, +++ 2/,) was any set of values. Accordingly, by 


the assumption we have made that our theorem is true for polyno- 
mials in m — 1 variables, all the coefficients of all the polynomials 
dy by ** Gm are zero. These, however, are simply the coefficients 
of f. Thus our theorem is proved. 

Since two polynomials are identically equal when and only 
when their difference is identically zero, we infer now at once the 
further theorem : 


THEOREM 2. <A necessary and sufficient condition that two poly- 
nomials be identically equal is that the coefficients of their corresponding 
terms be equal. 


We come next to 


THEOREM 3. If f, and f, are polynomials in any number of vari- 
ables of degrees m, and mz respectively, the product f, fz will be of de- 
gree m+ M2. 


This theorem is obviously true in the case of polynomials in one 
variable. If, then, assuming it to be true for polynomials in n — 1 
variables we can prove it to be true for polynomials in n variables, 
the proof of our theorem by the method of mathematical induction 
will be complete. 

Let us look first at the special case in which both polynomials 
are homogeneous. Here every term we get by multiplying them 
together by the method of elementary algebra is of degree m, + my. 
Our theorem will therefore be proved if we can show that there is at 
least one term in the product whose coefficient is not zero. For 
this purpose, let us arrange the two polynomials f, and f, according 
to descending powers of z,, 


Si(@y +++ Ly) = Py (Lay +++ Ly) Uy + Pl (Loy +++ Vy) OTF +, 
So(@y L_) = fy! (X2y HOH Dn)" Sr py! (a, ahr ae ea vee, 


Here we may assume that neither ¢) nor $j vanishes identically. 
Since f, and f, are homogeneous, ¢j and ¢j! will also be homogeneous 
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of degrees m,—, and m,—k, respectively. In the product f, fy the 
terms of highest degree in x, will be those in the product 


$4 (22, eae Bn) fy (xe, stan Xa, 


and since we assume our theorem to hold for polynomials in n— 1. 
variables, $)¢j will be a polynomial of degree m,+m,—k, — ky. 
Any term in this product whose coefficient is not zero gives us when 
multiplied by 2,"*% a term of the product f, f, of degree m,+m, 
whose coefficient is not zero. Thus our theorem is proved for the 
case of homogeneous polynomials. 

Let us now, in the general case, write f, and f, in the forms 


A(t» hes a Pmt ae Ln) + Piano see Lr) + tety 
SFo(@s is Ln) = a Con ee Ln) =f P1245 >, Ln) 2 ae?) 


where ¢; and ¢j' are homogeneous polynomials which are either of 
degrees 7 and 7 respectively, or which vanish identically. Since, 
by hypothesis, f, and f, are of. degrees m, and m, respectively, 
g),, and ¢/ will not vanish pepebiee but will be of degrees 
m, and my. 

The terms of highest degree in the product f, f, will therefore be 
the terms of the product ¢j,, ¢/!,, and this being a product of homo- 
geneous polynomials comes under the case just treated and is there- 
fore of degree m,+m,. The same is therefore true of the product 
Fi fy and our theorem is proved. 

By a successive application of this ficorent we infer 


CoroLLaRy. Jf k polynomials are of degrees my, Mg +++ mM, re- 
spectively, their product is of degree m+ Mm, + ++: + mg. 


We mention further, on account of their great importance, the 
two rather obvious results : 


THEOREM 4. [Tf the product of two or more polynomials is identi- 
cally zero, at least one of the factors must be rdentically zero. 


For if none of them were identically zero, they would all have 
definite degrees, and therefore their product would, by Theorem 8, 
have a definite degree, and would therefore not vanish identically. 

It is from this theorem that we draw our justification for cancel- 
ling out from an identity a factor which we know to be not identi- 
cally zero. 
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TuEorEM 5. If f(x, +++ %,) ts @ polynomial which is not identically 
zero, and if (a, «++ LZ) vanishes at all points where f does not vanish, 
then @ vanishes identically. 

This follows from Theorem 4 when we notice that fd =0. 


EXERCISES 
1. If f and ¢ are polynomials in any number of variables, what can be inferred 
from the identity £2=¢? concerning the relation between the polynomials f and $? 
2. If f,and fy are polynomials in (2, -+- %,) which are of degrees m, and my, 
respectively in «1, prove that their product is of degree m, + mg in 2}. 


3. Geometric Interpretations. In dealing with functions of a 
single real variable, the different values which the variable may 
take on may be represented geometrically by the points of a line ; 
it being understood that when we speak of a point x we mean the 
point which is situated on the line at a distance of 2 units (to the 
right or left according as 2 is positive or negative) from a certain 
fixed origin 0, on the line. Similarly, in the case of functions of — 
two real variables, the sets of values of the variables may be pictured 
geometrically by the points of a plane, and in the case of three real 
variables, by the points of space; the set of values represented by a 
point being, in each case, the rectangular codrdinates of that point. 
When we come to functions of four or more variables, however, this 
geometric representation is impossible. 

The complex variable 2 = & + ni depends on the two independent 
real variables € and 7 in such a way that to every pair of real values 
(&, 7) there corresponds one and only one value of x. The different 
values which a single complex variable may take on may, therefore, 
be represented by the points of a plane in which (£, 7) are used as 
cartesian codrdinates. In dealing with functions of more than one 
complex variable, however, this geometric representation is impos- 
sible, since even two complex variables x=&+ ni, y= E+ nt are 
equivalent to four real variables (&, , &, 7;)- 

By the neighborhood of a point x= a we mean that part of the 
line between the points x=a—«@ and a= a+ « (« being an arbitrary 
positive constant, large or small), or what is the same thing, all 
points whose codrdinates a satisfy the inequality |~—a|<«.* 


* We use the symbol |Z | to denote the absolute value of Z, i.e. the numerical 
value of Z if Z is real, the modulus of Z if Z is imaginary. 
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Similarly, by the neighborhood of a point (a, 6) in a plane, we 
shall mean all points whose codrdinates (x, y) satisfy the inequalities 


|r—al<a, ly—|<B, 


where « and 8 are positive constants. This neighborhood thus con- 
sists of the interior of a rectangle of which (a, 6) is the center and 
whose sides are parallel to the coordinate axes. 

By the neighborhood of a point (a, 6, ¢) in space we mean all 
points whose coérdinates (2, y, z) satisfy the inequalities 


ju—al<a, |y—b|<B, |z—el<y. 


In all these cases it will be noticed that the neighborhood 
may be large or small according to the choice of the constants 
a, By ¥- 

If we are dealing with a single complex variable x= &+ ni, we 
understand by the neighborhood of a point a all points in the plane 
of complex quantities whose complex codrdinate x satisfies the in- 
equality |z—a|<«, « being as before a real positive constant. Since 
|x — a| is equal to the distance between z and a, the neighborhood of 
a now consists of the interior of a circle of radius @ described about 
@ as center. 

It is found convenient to extend the geometric terminology 
we have here introduced to the case of any number of real or 
complex variables. Thus if we are dealing with n independent 
variables (x, %, --- %), we speak of any particular set of values 
of these variables as a point in space of n dimensions. Here 
we have to distinguish between real points, that is sets of values 
of the zs which are all real, and imaginary points in which 
this is not the case. In using these terms we do not propose 
even to raise the question whether in any geometric sense there 
is such a thing as space of more than three dimensions. We 
merely use these terms in a wholly conventional algebraic 
sense because on the one hand they have the advantage of 
conciseness over the ordinary algebraic terms, and on the other 
hand, by calling up in our minds the geometric pictures of three 
dimensions or less, this terminology is often suggestive of new 
relations which might otherwise not present themselves to us so 
readily. 
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By the neighborhood of the point (a,, dg, +++ dn) we understand all 
points which satisfy the inequalities 


|ay— a, |<, [a — ay|<dtg ** |L_ — An |< Ons 


where @,, G5, +++ %, are real positive constants. 

If, in particular, (a,, a, -** d,) is a real point, we may speak of 
the real neighborhood of this point, meaning thereby all real pas 
(24, Ys *** %) Which satisfy the above inequalities. 

As an illustration of the use to which the conception of the 
neighborhood of a point can be put in algebra, we will prove the 
following important theorem : 


THEOREM 1. A necessary and sufficient condition that a poly- 
nomial f(x, +++ %,) vanish identically is that it vanish throughout the 
neighborhood of a point (ay, +++ dy 


That this is a necessary condition is obvious. To prove that it 
is sufficient we begin with the case n=1. 

Suppose then that f(z) vanishes throughout a certain neighbor- 
hood of the point z=a. If f(x) did not vanish identically, it would 
be of some definite degree, say #, and therefore could not vanish at 
more than & points (cf. Theorem 8,§ 1). This, however, is not the 
case, since it vanishes at an infinite number of points, namely all 
points in the neighborhood of z=a. Thus our theorem is proved 
in the case n= 1. 

Turning now to the case n = 2, let 


Fs Y) = Goly)a* + by(y)ar t+ + $.(y) 


be a polynomial which vanishes throughout a certain neighborhood 
of the point (a, 6), say when 


lz—al<a, |y—b\/<B. 
Let y be any constant satisfying the inequality 
lYo— | <B. 


Then f(x, yy) is a polynominal in z alone which vanishes whenever 
|v—al<a, Hence, by the case »=1 of our theorem, f(z, y,)= 0. 


That is, 
bo(Yo) = $1 (Yo) = ++ = Pi(Yo) = 0 
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Thus all these polynomials ¢ vanish at every point y, in the neigh- 
borhood of y=6, and therefore, by the case n=1 of our theorem, 
they are all identically-zero. From this it follows that for every 
value of x, f(a, y) vanishes for all values of y, that is f=0, and 
our theorem is proved. 

We leave to the reader the obvious extension of this method of 
proof to the case of n variables by the use of mathematical induction. 

From the theorem just proved we can infer at once the following: 


THEOREM 2. A necessary and sufficient condition that two polyno- 
mials in the variables (x,,---x,) be identically equal is that they be 
equal throughout the neighborhood of a point (a,,--+a,). 


EXERCISES 


1. Theorem 3, § 1 may be stated as follows: If f is a polynomial in one 
variable which is known not to be of degree higher than n, then if f vanishes at 
n-+ 1 distinct points, it vanishes identically. 

Establish the following generalization of this theorem : 

If fis a polynomial in (2, y) which is known not.to be of higher degree than 
nin z, and not of higher degree than m in y, then, if f vanishes at the 


(n+1) (m+1) distinct points: ( Peper yr 1) 


(Zi, yj) aa 
it vanishes identically. J=1,2,-- m+1 


2. Generalize the theorem of Exercise 1 to polynomials in any number of 
variables. 


3. Prove Theorem 4, § 2 by means of Theorem 1 of the present section; and 
from this result deduce Theorem 3, § 2. 


4. Do Theorems 1 and 2 of this section hold if we consider only real polyno- 
mials and the real neighborhoods of real points? 


4. Homogeneous Codrdinates. Though only two quantities are 
necessary in order to locate the position of a point in a plane, it is 
frequently more convenient to use three, the precise values of the 
quantities being of no consequence, but only their ratios. We will 
represent these three quantities by 2, y, ¢, and define their ratios by 


the equations &_y Y_¥. 
: t a b) t = ’ 


where X and Vare the cartesian codrdinates of a point in a plane. 
Thus (2, 3, 5) will represent the point whose abscissa is 2 and whose 
ordinate is 8. Any set of three numbers which are proportional to 
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(2, 3, 5) will represent the same point. So that, while to every 
set of three numbers (with certain exceptions to be noted below) 
there corresponds one and only one point, to each point there cor- 
respond an infinite number of different sets of three numbers, all 
of which, however, are proportional. 

When ¢=0 our definition is meaningless; but if we consider 
the points (2, 3, 1), (2, 3, 0.1), (2, 8, 0.01), (2, 3, 0.001), ---, which 
are, in cartesian codrdinates, the points (2, 3), (20, 30), (200, 300), 
(2000, 8000), --, we see that they all lie on the straight line through 
the origin whose slope is 3. Thus as ¢ approaches zero, w and y 
remaining fixed but not both zero, the point (a, y, t) moves away 
along a straight line through the origin whose slope is y/z. Hence 
it is natural to speak of (a, y, 0) as the point at infinity on the line 
whose slope is y/z. If ¢ approaches zero through negative values, 
the point will move off along the same line, but in the opposite 
direction. We will not distinguish between these two cases, but 
will speak of only one point at infinity on any particular line. It 
can be easily verified that if a point moves to infinity along any line 
parallel to the one just considered, its homogeneous codrdinates may 
be made to approach the same values (2, y, 0) as those just obtained. 
It is therefore natural to speak of the point at infinity in a cer- 
tain direction rather than on a definite line. Finally we will agree 
that two points at infinity whose codrdinates are proportional shall 
be regarded as coinciding, since these codrdinates may be regarded 
as the limits of the codrdinates of one and the same point which 
moves further and further off.* ; 

If = y=t=0, we will not say that we have a point at all, since 
the codrdinates of any point whatever may be taken as small as we 
please, and so (0, 0, 0) might be regarded as the limits of the codér- 
dinates of any fixed or variable point. 


* It should be noticed that in speaking of points at infinity we are, considering 
the matter from a purely logical point of view, doing exactly the same thing that we 
did in § 3 in speaking of imaginary points, or points in space of m dimensions ; that is, 
we are speaking of a set of quantities as a ‘point’? which are not the codrdinates of 
any point. The only difference between the two cases is that the coordinates of our 
‘‘point at infinity ’’? are the limits of the codrdinates of a true point. 

Thus, in particular, it is a pure convention, though a desirable and convenient 
one, when we say that two points at infinity shall be regarded as coincident when and 
only when their coordinates are proportional. We might, if we chose, regard all 
points at infinity as coincident. There is no logical compulsion in the matter. 
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The equation 
Axe BAY + OY 24+ DX+HY+ F=0 


becomes, in homogeneous coérdinates, 





2 2 
At 4B 4 Of "4 HL+F=0 
at ate eet at ,t# ) 
or Ax? + Bey + Cy? + Det + Hyt + Fe=0, 


a homogeneous equation of the second degree; and it is evident that 
if the codrdinates X, Y in any algebraic equation be replaced by 
the codrdinates 2, y, t, the resulting equation will be homogeneous, 
and of the same degree as the original equation. It is to this fact 
that the system owes its name, as well as one of its chief advantages. 

The equation ee By eae 0 
represents, in general, a line, but if A= B=0, (#0, it has no true 
geometric locus. It is, in this case, satisfied by the codrdinates of all 
points at infinity, and by the coérdinates of no other point. We shall 
therefore speak of it as the equation of the line at infinity. The reader 
may easily verify, by using the equation of a line in terms of its inter- 
cepts, that if a straight line move further and further away, its homo- 
geneous equation will approach more and more nearly the form t=0. 

In space of three dimensions we will represent the point whose 
cartesian codrdinates are X, Y, Z by the four homogeneous codrdi- 
nates x, y, 2, t, whose ratios are defined by the equations 

ee ae, 
t t t 
We will speak of (2, y, 2, 0) as “the point at infinity” on a line 
whose direction cosines are 
s ebed. eiee Z @ : 
VELP+E VeEtP+e VeP+ PHA 








(0, 0, 0, 0) will be excluded, and ¢ = 0 will be spoken of as the equa- 
tion of the plane at infinity. 

Extending the same terminology to the general case, we shall 
sometimes find it convenient to speak of (2, x,, --- #,) not as a point 
in space of n dimensions, but as a point represented by its homo- 
geneous codrdinates in space of »—1 dimensions, Two points 
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whose coordinates are proportional will be spoken of as identical, 
a point whose last codrdinate is zero will be spoken of as a point at 
infinity, and the case 2,=---=z,=0 will not be spoken of as a 
point at all. This terminology will be adopted only in connection 
with homogeneous polynomials, and even then it must be clearly 
understood that we are perfectly free to adopt whichever terminol- 
“ogy we find most convenient. Thus, for instance, if f(x, vg, 73) 1s 
a homogeneous polynomial of the second degree, the equation f= 0 
may be regarded either as determining a conic in a plane (2, 2%, X53 
being homogeneous coérdinates) or a quadric cone in space (2, 2, Xs 
being ordinary cartesian codrdinates). 

Homogeneous coérdinates may also be used in space of one 
dimension. We should then determine the points on a line by two 
coérdinates x,t whose ratio x/t is the non-homogeneous coérdinate 
X, t.e. the distance of the point from the origin. It is this repre- 
sentation that is commonly made use of in connection with the 
theory of binary forms. 


5. The Continuity of Polynomials. 
_ Derryition. A function f(a, --- 2) ts said to be continuous at the 
point (G++ ¢,) no matter how small a positive quantity ¢ be chosen, 
a neighborhood of the point (¢, ++. ¢,) can be found so small that the dif- 
Ference between the value of the function at'any point of this neighbor- 
hood and its value at the point (¢, +++ ¢,) ts in absolute value less than e. 

That is, f is continuous at (¢, --- ¢,) if, having chosen a positive 
quantity ¢, it is possible to determine a positive 6 such that 

| fen ce In) — J (Gy ae Cn) | <e€ 
for all values of (2, --- x,) which satisfy the inequalities, 
| 2, —¢,| <0, | at — ¢, | = 5, oH |Z, — ey, | <6. 

THEoREM 1. If two functions are continuous at a point, their 

sum is continuous at this point. 


Let f, and f, be two functions continuous at the point (¢, +++ ¢,) 
and let k, and k, be their respective values at this point. Then, no 
matter how small the positive quantity e may be chosen, we may 
take 6, and 6, so small that 

lA-kl<de when | 7 — ¢;| < 6, 
lfz—hgl<de when | #;—¢;| < 6. 
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Accordingly 
lit—-ml+lh—-hi<e when | 2;—¢| < 4, 
where 6 is the smaller of the two quantities 6, and 6,; and, since 
|A|+|B|2|A+B], we have 
Ai — Br + Fo — hel = (A, +52) — (ey + q)|<e when |a;— ¢;|< 6. 
Hence f, +f, is Parmar at the point (¢, *** ¢,). 


CoroLuARY. Tf a finite number of functions are continuous at a 
point, their sum is continuous at this point. 


THEOREM 2. Jf two functions are continuous at a point, their 
product is continuous at this point. 


Let f, and f, be the two functions, and k, and fk, their values at 
the point (¢, --- ¢,) where they are assumed to be continuous. We 
have to prove that however small e may be, 6 can be chosen so small 
that 


(1) tifa — hyhy| <e when |2,— ¢| <6. 


Let be a positive constant, which we shall ultimately restrict to a 
certain degree of smallness, and let us choose two positive constants 
oy and. oN such that ve Cw ky | <n when |x, <= ¢,| = 8; 

a = Nal <7 when |u; — 6,|< dy, 
Now take § as the smaller of the two quantities 6, and 6. Then, 
when | 2; =e e;| <a 8, 

Fifa — hey heal = aC fA, — Ay) + hy Pa — f)| 
S| fal At — Aa + al 2 — Fel S Bal + [A/S 


Accordingly since, when |x; — ¢;|< 6, 
| Fa) = | ho + (Fo — he)| < |Fa| + | fg — ha] <|Fel +0, 


we may write 
(2) Fifa — Hy hal < {| y] + [Aal3 + 


If k, and k, are not both zero, let us take » small enough to satisfy 


the two inequalities , 3 
5. PN 
7 Bley + [al 2 
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If k, =k, =0, we will restrict » merely by the inequality 
n< ve. 


In either case, inequality (2) then reduces to the form (1) and our 
theorem is proved. 


CoroLtuARY. Jf a finite number of functions are continuous at a 
point, their product is continuous at this point. 


Referring now to our definition of continuity, we see that any 
constant may be regarded as a continuous function of (x, --+ w,) for 
all values of these variables, and that the same is true of any one of 
these variables themselves. Hence by the last corollary any function 
of the form Cxh eat ain, where the &’s are integers positive or zero, is 
continuous at every point. If we now refer to the corollary to 
Theorem 1, we arrive at the theorem: 


THEOREM 3. <Any polynomial is a continuous function for all 
values of the variables. 


Finally, we give a simple application of this theorem. 


THEOREM 4. Jf f(2,, --- x,) ts a polynomial and f(y, --- c,) #0, 
it is possible to take a neighborhood of the point (¢, --- ¢,) so small that 
J does not vanish at any point in this neighborhood. 


Let k=f(¢,, ++ ¢,). Then, on account of the continuity of f at 


(cy, -++ €,), a positive quantity 6 ean be chosen so small that through- 
out the neighborhood |x; — ¢;| <8, the inequality 


If—h| <3 Ik 


is satisfied. In this neighborhood f cannot vanish; for at any point 
where it vanished we should have 


| Pb = |< 31k 
which is impossible since by hypothesis k+ 0. 


6. The Fundamental Theorem of Algebra. Up to this point no 
use has been made of what is often known as the fundamental 
theorem of algebra, namely the proposition that every algebraic 


equation has a root. This fact we may state in more precise form 
as follows : 
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THEOREM 1. Jf f(x) is a polynomial of the nth degree where 
n= 1, there exists at least one value of x for which f(x) = 0. 


This theorem, fundaniental though it is, is not necessary for 
most of the developments in this book. Moreover, the methods of 
proving the theorem are essentially not algebraic, or only in part 
algebraic. Accordingly, we will give no proof of the theorem here, 
but merely refer the reader who desires a formal proof to any of the 
text-books on the theory of functions of a complex variable. We 
shall, however, when we find it convenient to do so, assume the 
truth of this theorem. In this section we will deduce a few of its 
more immediate consequences. 


THEOREM 2. If f(x) is a polynomial of the nth degree, 
F(@) = aya” + aya” os Hy ye + dy (a) #9), 


there exists one and only one set of constants, 01, cy, +++ On, such that 
F(X) = Ay(v — %)(@ — Oy) +++ (@ — Oy) 


This theorem is seen at once to be true for polynomials of the 
first degree. Let us then use the method of mathematical induction 
and assume the proposition true for all polynomials of degree less 
than n. If we can infer that the theorem is true for polynomials of 
the nth degree, it follows that being true for those of the first degree 
it is true for those of the second, hence for those of the third, etc. 

By Theorem 1 we see that there is at least one value of # for 
which f(z)=0. Call such a value a, By Theorem 1, § 1 we may 


write 
S(2) = (v— %) (2), 
where p(x) =a 2" 1 + bar + oe +b, 
Since $(z) is a polynomial of degree n —1, and since we are assum- 


ing our theorem to be true for all such polynomials, there exist 
n —1 constants @,, +--+ «, such that 


h(x) = a(x — Gy) +++ (4 — Oy) 
Hence F(x) = 4y(@ — a) (@ — ay) +++ (@ — Oy) 


Thus half of our theorem is proved. 
Cc 
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Suppose now there were two such sets of constants, a, +++ %, and 
By, + 8, We should then have 


(1) f(x) = ag(v — a) +++ (@ = thy) = g(a — By) + (@— Pr)- 
Let « =a, in this identity. This gives 
dt (oy — By)(ay — Ba) *** (4 — Bn) = 9- 


Accordingly, since a,#90, «, must be equal to one of the quan- 
tities B,, By, -** RB, Let us suppose the §’s to have been taken in | 
such an order that «,=,. Now in the identity (1) cancel out the © 
factor a,(—«,) (see Theorem 4, § 2). This gives 


(@ ey Oty) as (x = Pe) ae P2) ae (a Bs)» 


Accordingly, since we have assumed the theorem we are proving — 
to be true for polynomials of degree n— 1, the constants A,, --- 8, 
are the same, except perhaps for the order, as the constants @,, +--+ @,, 
and our theorem is proved. 


DEFINITION. The constants 0, +++ «, determined in the last theo- 
rem are called the roots of the polynomial f(a), or of the equation 
f(x)=9. If k of these roots are equal to one another, but different 
from all the other roots, this root is called a k-fold root. 

It is at once seen by reference to Theorem 1, § 1 that these roots 
are the only points at which f(x) vanishes. 


THEOREM 3. Jf f(a, + %)ts a polynomial which is not identt- 
cally equal to a constant, there are an infinite number of points (ay, +++ Lp) 
at which f#0, and also an infinite number at which f=9, provided 
n>. 

The truth of the first part of this theorem is at once obvious, for, 
since f is not identically zero, a point can be found at which it is not 
zero, and then a neighborhood of this point can be taken so small 
that f does not vanish in this neighborhood (Theorem 4, § 5). This 
neighborhood, of course, consists of an infinite number of points. 

To prove that f vanishes at an infinite pumber of points, let us 
select one of the variables which enters into f to at least the first 
degree. Without loss of generality we may suppose this variable 
to be 2; We may then write 


Say Ly == F(a “++ Dy) ah Fy (@o, oa Ot ee + F,,( 22, cae Lqy)s 


POLYNOMIALS AND THEIR FUNDAMENTAL PROPERTIES 19 


where k>1 and F) is not identically zero. Let (¢,, ++ ¢,) be any 
point at which /is not zero. Then f(a, ¢, «++ ¢,) is a polynomial 
of the kth degree in 2, alone. Accordingly, by Theorem 1, there is 
at least one value of 2, for which it vanishes. If c, is such a value, 
F (C45 Cg, *** Cn) = 0. Moreover, by the part of our theorem already 
proved, there are an infinite number of points where F, + 0, that 
is an infinite number of choices possible for the quantities ¢,, --- ¢. 
Thus our theorem is completely proved. 

Finally, we will state, without proof, for future reference, a 
theorem which says, in brief, that the roots of an algebraic equation 
are continuous functions of the coefficients : 


THEOREM 4. Jf «ais a root of the polynomial 


* 
Ay” + aU" 1+ + + dy E+ Ay (Oe * 
9 


then no matter how small a neighborhood |x — «a|<e of the point « we 
may consider, it is possible to take in space of n+ 1 dimensions a neigh- 
borhood of the point (ay,44, «++ an) so small that, tf (bo, 54, «+: 6,) 18 any 
point in this neighborhood, the polynomial 

6,0" + 6,0" 1+---+5,_.24 6, 
has at least one root B in the neighborhood |x — a|<e of the point a. 


For a proof of this theorem we refer to Weber’s Algebra, 
Vol. 1, § 44. 


* The theorem remains true if we merely assume that the polynomial is of at least 
the first degree. That is, some of the first coefficients a, a1, --- may be zero. 


CHAPTER II 
A FEW PROPERTIES OF DETERMINANTS 


7. Some Definitions. We assume that the reader is familiar with 
the determinant notation, and will merely recall to him that by a 
determinant of the nth order 


igs ein 
M1 %2 °°" Aon 
Any Ang see Ann 


we understand a certain homogeneous polynomial of the nth degree 
in the n? elements a, By the side of these determinants it is often 
desirable to consider the system of the n? elements arranged in the 
order in which they stand in the determinant, but not combined 
into a polynomial. Such a square array of n? elements we speak 
of as a matriz. In fact, we will lay down the following somewhat 
more general definition of this term: 


DEFINITION 1. A system of mn quantities arranged in a rectangu- 
lar array of m rows and n columns is called a matrix. If m=n, we 
say that we have a square matrix of order n. 


It is customary to place double bars on each side of this array, 
thus : 





M1 Ap * Un 
Oey Ugg? “Soe 
Any Ang irs Onn 


20 


A FEW PROPERTIES OF DETERMINANTS 21 


Sometimes parentheses are used, thus: 


My Ap ** An 
M1 U2 *** on 


Unt Ang, Ann 


Even when a matrix is square, it must be carefully noticed that 
it is not a determinant. In fact, a matrix is not a quantity at all,* 
but a system of quantities. This difference between a square ma- 
trix and a determinant is clearly brought out if we consider the 
effect of interchanging columns and rows. This interchange has no 
effect on a determinant, but gives us a wholly new matrix. In fact, 
we will lay down the definition: 


DEFINITION 2. Two square matrices 


Nt toe in City Ss blot 


Any *** Ann Bin *"* Ann 


of which either is obtained from the other by interchanging rows and 
columns are called conjugate + to each other. 


Although, as we have pointed out, square matrices and deter- 
minants are wholly different things, every determinant determines a 
square matrix, the matrix of the determinant, and conversely every 
square matrix determines a determinant, the determinant of the 
matrix. 

Every matrix contains other matrices obtained from it by strik- 
ing out certain rows or columns or both. In particular it contains 
certain square matrices; and the determinants of these square 
matrices we will call the determinants of the matrix. If the matrix 
contains m rows and n columns, it will contain determinants of all 
orders from 1 (the elements themselves) to the smaller of the two 
integers and m and n inclusive.t In many important problems all 


* Cf., however, § 21. + Sometimes also transposed. 
t If m=n, there is only one of these determinants of highest order, and it was this 
which we called above the determinant of the square matrix. 
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of these determinants above a certain order are zero, and it is often 
of great importance to specify the order of the highest non-vanish- 
ing determinant of a given matrix. For this purpose we lay down 
the following definition : 


DEFINITION 3. A matrix is said to be of rank r if it contains at 
least one r-rowed determinant which is not zero, while all determinants 
of order higher than r which the matrix may contain are zero. 

A matrix is said to be of rank 0 of all its elements are zero. 


For brevity, we shall speak also of the rank of a determinant, 
meaning thereby the rank of the matrix of the determinant. 

We turn now to certain definitions concerning the minors of 
determinants; that is, the determinants obtained from the given 
determinant by striking out certain rows and columns. 

It is a familiar fact that to every element of a determinant 

corresponds a certain first minor; namely, the one obtained by 
striking out the row and column of the determinant in which 
the given element lies. Now the elements of a determinant 
of the nth order. may be regarded as its (n—1)th minors. 
Accordingly we have here a method of- pairing off each one- 
rowed minor of a given determinant with one of its (n—1)-rowed 
minors. 
Similarly, if MZ is a two-rowed minor of a determinant of the 
nth order D, we may pair it off against the (m — 2)-rowed minor V 
obtained by striking out from D the two rows and columns which 
are represented in M: The two minors M@ and WV we will speak of 
as complementary. ‘Thus, in the determinant 


G11 M9 Wg Oi, “Se 

Mo1 Ag yg gg Ags 

M1 Aq Ugg gg 9 Ags |, 

My Asn Aggy Ugg Uys 

G1 Ag sg Ws es 
the two minors 


oy %%3I, 








431 Ugg 


are complementary. 
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In the same way we pair off with every three-rowed minor an 
(n— 3)-rowed minor; etc. In general we lay down’ 


DeEFINITION 4. Jf D is a determinant of the nth order and M 
one of its k-rowed minors, then the (n—k)-rowed minor N obtained by 
striking out from D all the rows and columns represented in M is called 
the complement of M. 


Conversely, Mis clearly the complement of WV. 

Let us go back now for a moment to the case of the one-rowed 
minors; that is to the elements themselves. Let a, be the element of 
the determinant D which stands in the 7th row and the jth column. 
Let D,; represent the corresponding first minor. It will be recalled 
that we frequently have occasion to consider not this minor D, but 
the cofactor A; of a; defined by the equation A, =(—1)'*/Dy. 

Similarly, it is often convenient to consider not the complement 
of a given minor but its algebraic complement, which in the case just 
mentioned reduces to the cofactor, and which, in general, we define 
as follows: 


Derrmition 5. If M is the m-rowed minor of D in which the rows 
ky, -+- tm, and the columns |, --- l, are represented, then the algebraic com- 
plement of M is defined by the equation 


ale. compl. of M=(—1)htt*=tht+/[ compl. of HM]. 


The following special case is important: 


DEFINITION 6. By a principal minor of a determinant D is under- 
stood a minor obtained by striking out from D the same rows as columns. 


Since in this case, using the notation of Definition 5, we have 
key + + kim = Lt tly 


it follows that the algebraic complement of any principal minor is equal 
to its plain complement. 

We have so far assumed tacitly that the orders of the minors we 
were dealing with were less than the order n of the determinant 
itself. By the n-rowed minor of a determinant D of the mth order 
we of course understand this determinant itself. The complement 
_of this minor has, however, by our previous definition no meaning. 
We will define the complement in this case to be 1, and, by Definition 5, 
this will also be the algebraic complement. 
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EXERCISE 


Prove that, if 4 and N are complementary minors, either M and N are the 
algebraic complements of each other, or — N is the algebraic complement of 
M and — M is the algebraic complement of NV. 


8. Laplace’s Development. Just as the elements of any row or 
column and their corresponding cofactors may be used to develop a 
determinant in terms of determinants of lower orders, so the k-rowed 
minors formed from any k rows or columns may be used, along with 
their algebraic complements, to obtain a more general development 
of the determinant, due to Laplace, and which includes as a special 
case the one just referred to. In order to establish this develop- 
ment, we begin with the following preliminary theorem : 


THEOREM 1. Jf the rows and columns of a determinant D be 
shifted in such a way as to bring a certain minor M into the upper left- 
hand corner without changing the order of the rows and columns either 
of M or of its complement N, then this shifting will change the sign of 
D or leave it unchanged according as — Nor Nis the algebraic com- 
plement of M. 


To prove this let us, as usual, number the rows and columns of 
D, beginning at the upper left-hand corner, and let the numbers of 
the rows and columus represented in M, arranged in order of increas- 
ing magnitude, be £,,---k,,, and l,, ---l, respectively. In order to 
effect the rearrangement mentioned in the theorem, we may first 
shift the row numbered k, upward into the first position, thus carry- 
ing it over k, —1 other rows and therefore changing the sign of the 
determinant k,—1 times. Then shift the row numbered /, into the 
second position. This carries it over k, — 2 rows and hence changes 
the sign k, —2 times. Proceed in this way until the row numbered 
k,, has been shifted into the mth position. ‘Then shift the columns 
ina similar manner. ‘The final result is to multiply D by 


(— Hyper eh s+ hlm—2(1+2+4 +m) —. (— Lire tkm+lt +m 


Comparing this with Definition 5, § 7, the truth of our theorem is 
obvious. 


LemMA. Jf Mis a minor of a determinant D, the product of M 
by its algebraic complement is tdentical, when expanded, with some of 
the terms of the expansion of D. 
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Gy Fin 
Let D= 5 
Any °° Ann 


and call the order of M, m, and its complement V. We will first 
prove our lemma in the special case in which YW stands in the upper 
left-hand corner of D, so that WV, which in this case is the algebraic 
complement, is in the lower right-hand corner. What we have to 
show here is that the product of any term of Mby a term of MW is a 
term of D, and that this term does not come in twice to the product 
MN. Any term of M may be written 


( ir 1)tay, Gy, os Uni? 


where the integers J,, /,,---/, are merely some arrangement of the 
integers 1, 2, ---m, and mw is the number of inversions of order in this 
arrangement. Similarly, any term of VV may be written 


Cj Laney, lin-+1 Un-+2, Im+2 ne An, 1? 


where ,,.,, -*: J, is merely some arrangement of the integers m+ 1, 
--- m, and v is the number of inversions of order in this arrangement. 
The product of these two terms 


( _— af yay, AsIy eee Ani,» 


is a term of D, for the factors a are chosen in succession from the 
first, second, --- nth rows of D, and no two are from the same col- 
umn, and «+ v is clearly precisely the number of inversions of order 
in the arrangement J,, /,, --- J,,as compared to the natural _arrange- 
ment, 1, 2, --- n, of these integers. 

Having thus proved our lemma in the special case in which M 
lies in the upper left-hand corner of D, we now pass to the general 
case. Here we may, by shifting rows and columns, bring MW into 
the upper left-hand corner and W into the lower right-hand corner. 
This has, by Theorem 1, the effect of leaving each term in the 
expansion of D unchanged, or of reversing the sign of all of them 
according as V or — Wis the algebraic complement of M. Accord- 
ingly, since the product MW gives, as we have just seen, terms in 
the expansion of this rearranged determinant, the product of WM by 
its algebraic complement gives terms in the expansion of D itself, 
as was to be proved. 
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Laplace’s Development, which may be stated in the form of the 
following rule, now follows at once: 


THEOREM 2. Pick out any m rows (or columns) from a determt- 
nant D, and form all the m-rowed determinants from this matrix. The 
sum of the products of each of these minors by its algebraic complement 

is the value of D. 


Since, by our lemma, each of these products when developed con- 
sists of terms of D, it remains merely to show that every term of 
D occurs in one and only one of these products. This is obviously 
the case; for every term of D contains one element from each of the 
m rows of D from which our theorem directs us to pick out m-rowed 
determinants, and, since these elements all lie in different columns, 
they lie in one and only one of these m-rowed determinants, say MV. 
Since the other elements in this term of D obviously all lie in the 
complement WV of WM, this term will be found in the product MN and 
in none of the other products mentioned in our theorem. 


EXERCISES 
1. From a square matrix of order n and rank r, s rows (or columns) are selected. 
Prove that the rank of the matrix thus obtained cannot be less than r+s—n. 
2. Generalize the theorem of Exercise 1. 


9. The Multiplication Theorem. Laplace’s Development enables 
us to write out at once the product of any two determinants as a 
single determinant whose order is the sum of the orders of the two 


given determinants : 
ayy eee Ain 0 eee 0 : 


Git" Gin MBq tba On, *** Onn 0+» 0 


nn 
|= | Part Pin 94, ++ On|, 


mm 


Any *** Any Ong ==) 





Pm *** Pmn On ie Ob 


whatever the values of the p’s may be. For, expanding the large 
determinant in terms of the n-rowed minors of the first n rows, al 


the terms of the expansion are zero except the one written in th 
first member of the equation. 
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From this formula we will now deduce a far more important 
one for expressing the product of two determinants of the same 
order as a determinant of that order. For this purpose let us choose 
the p’s in the last formula as follows: 


wheni#j, p,=—1, 


Py=9 


and let us consider for simplicity the product of two determinants 
of the third order. We have 


ay ho ae 0 0 0 
Cmca 8 a a, as B 1 By Bs 0 0 0 
Pee ee.) 1. bc oideah es OF 0a 0: 
i ¥2 3 Cy Co C3 am 0 0 ay On) as 

0 —1 Osborn. 204 


0 Ol. Reece, ac, 

Let us now reduce this six-rowed determinant by multiplying 
its first column by a, and adding it to the fourth column; then 
multiply the first column by a, and add it to the fifth; then 
multiply the first column by a, and add it to the sixth. In this 
way we bring zeros into the last three places in the fourth row. 
Next multiply the second column successively by 6,, 0,, 6, and 
add it to the fourth, fifth, and sixth columns respectively. 
Finally multiply the third column successively by ¢,, ¢, c, and 
add it to the fourth, fifth, and sixth columns. The determinant 
thus takes the form 


ty Ag+ Onda + Og Cy 0 Agt Gaby + tg cg 


hy Cg eg hy Ay + Oy by +c, 

By By Bs Byay+Byb,+83e By 4, +8yb,+ Rely By4g+ Bobs + Beeg 

YM V2 Ys 1% +%2%1+%% 14+ abot 3% 143+ 253+ ¥8%s), 
ste > 0 0 0 0 

0-1 0 0 0 0 

0 0-1 0 0 0 


and this reduces at once to the three-rowed determinant 


Oty iy + indy + Mg cy 
By, + Bobs + Bae 
14, + Y2by + 201 


4 Ay + Oo by + Ugly 
Bq + Bybg + Bs eq 
1144q + 209 + Y3% 


04 Ag + Ogbg + Oy cy 
B43 + Boog + Bgeg|’ 
1143 + Yq53 + 30s 
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We have thus expressed the product of two determinants of the 
third order as a single determinant of the third order. The method 
we have used is readily seen to be entirely general, and we thus get 
the following rule for multiplying together two determinants of the 
nth order : 


THEOREM. The product of two determinants of the nth order 
may be expressed as a determinant of the nth order in which the 
element which lies in the ith row and jth column is obtained by 
multiplying each element of the ith row of the first factor by the 
corresponding element of the jth column of the second factor and 
adding the results. 


It should be noted that changing rows into columns in either or 
both of the given determinants, while not affecting the value of the 
product, will alter its form materially. For example, 


20 41 


Paes sr LY: 

4 5 ‘|e l= {oe 73 = 
28) | 6/28" 30K e, 
4S |T Ol” 180.80) 2 
2 4) (17/126 SOL 
8.5) 16) 9] 4\385 Gehe 
2 4) ]1 6) 180 48) oe 
8. B17 $9) (88 Gaps 


and similarly the product of any two determinants of the same order 
may be written in four different forms. 


10. Bordered Determinants. If to a determinant of the nth 
order we add one or more rows and the same number of columns of 
m quantities each and fill in the vacant corner with zeros, the result- 
ing determinant is called a bordered determinant. Thus starting from 
the two-rowed determinant 


a B 
y 6 
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we may form the bordered determinants 


ld " 


U, uu 
eB uy wu ESE TO a 
ets: uy] y 8 Uy Uy wy 
"0 tae ce 
y 8 Ul, if el eer WO atatr 
me 0 ye aan wee, 0) 0). 0 
if 
Ahi ae AU vo! of 0 0 0 


If in the second of these examples we use Laplace’s Development 
to expand the bordered determinant according to the two-rowed 
determinants of the last two rows, we see that its value is 


I 
Uy Uy 
! 
Uy Uh, 


V1 1% 


I ae 
Wy % 














a quantity into which the elements «, 8, y, 6 of the original deter- 
minant do not enter. Similarly expanding the third of the above 
bordered determinants according to the three-rowed determinants of 
its last three rows, we see that its value is zero. 

The reasoning we have here used is of general application and 
leads to the following results: 


THEOREM 1. Jf a determinant of the nth order is bordered with 
n rows and n columns, the resulting determinant has a value which 
depends only on the bordering quantities. 


THEOREM 2. If a determinant of the nth order is bordered with 
more than n rows and columns, the resulting determinant always has the 
value zero. 


The cases of interest are therefore those in which the deter- 
minant is bordered with less than m rows and columns. Concerning 
these we will establish the following fact: 


TuerorEeM 3. Jf a determinant of the nth order be bordered by p 
rows and p columns (p <n) of independent variables, the resulting 
determinant is a polynomial of degree 2p in the bordering quantities, 
whose coefficients are the pth minors of the original determinant ; and 
conversely, every pth minor of the original determinant is the coefficient 
of at least one term of this polynomial. 
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Let us consider the special case where n = 4 and p=. 


Cee Ade eg 
YM MeO 
Developing this determinant, by Lapl 
of the two-rowed determinants of the 
M13 Ag Uy 


Mg Aeq Ug 








3g Ag, Ug 
4g Ugg Uy 


If now we expand each of these 


Uy Uy 
Uy Up 
Ug Ue 
Uy Uy ; 
0 0 
eas) 


ace’s method (§ 8), in terms 
last two rows, we have 

uy 

ul. 
2/4... to 6 terms. 

| 

Us| - 


uh | 


four-rowed determinants, by 


Laplace’s method, in terms of the two-rowed determinants of their 
last two columns, and then arrange the result as a polynomial in 


the w’s and v’s, the truth of the theorem is apparent. 


to the reader to fill in the details of the 


We leave it 
proof here sketched. 


11. Adjoint Determinants and their Minors. 


DEFINITION. Jf, in the determinant 


Wa oe ois 


Aj; is the cofactor of the element a;;, then the determinant 


Ay, a Aj, 


Dia!" 


Any a ye 


is called the adjoint of D. 
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By corresponding minors of D and D’, or indeed of any two 
determinants of the same order, we shall naturally understand 
minors obtained by striking out the same rows and columns from 
Das from D'. Thesé definitions being premised, the fundamental 

theorem here is the following: 


THEOREM. Jf D! is the adjoint of any determinant D, and M and 
M’' are corresponding m-rowed minors of D and D' respectively, then 
M' is equal to the product of D™™ by the algebraic complement of M. 


We will prove this theorem first for the special case in which 
the minors Mand M’ lie at the upper left-hand corners of D and D! 
respectively. We may then write 


eee tee %.. Ais 
Any Aim Ann 
M =10 rent at 6 0 
0 On Ost 0 


0 cae. 0s ieies cas). 
Let us now interchange the columns and rows of D, 
44 aoe Any 
Da tml Sl. 
An "** Ann 


and then form the product M’D by the theorem of § 9. This gives 


D 0 ant) 0 eG 
0 D a) 0 ie AG 
MD=\0 0 Pes, 0 
A, m+1 A, m+1 °°° Am, m+1 Amti,m+1 °°" an, m+1 
Ain Aon ses Amn Am+1, see Ann 
Amn+i,mt+1 °° An, m+1 


= D™)* 


An+1, n see Ann 
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Let us here regard a4, **+ Gp, a8 n* independent variables. 
Then the equation just written becomes an identity, from which D, 
since it is not identically zero, may be cancelled out, and we get 


Amtipmt1 °°" %,m+1 
(1) Mw =D" 
Unian + Onn 


Since the determinant which is written out in (1) is precisely the 
algebraic complement of M, our theorem is proved in the special 
case we have been considering. It should be noticed that this proof 
holds even in the case m=n; cf. Corollary 2 below. 

Turning now to the case in which the minors Mand M’ do not 
lie at the upper left-hand corners of Dand D’, let us denote by a the 
sum of the numbers which specify the location of the rows and 
columns in M or M', the numbering running, as usual, from the up- 
per left-hand corner. Then by Definition 5, § 7, 


(2) alg. compl. of M=(—1)* [compl. of 7]. 


Let us now, by shifting rows and columns, bring the determinant 
M into the upper left-hand corner of D. Calling the determinant 
D, as thus rearranged, D,, we have (cf. Theorem 1, § 8) 


(3) D,=(—-1)D. 


The cofactors in D, are equal to (— 1)*A,, since the interchange of 
two adjacent rows or columns of a determinant changes the sign of 
‘every one of its cofactors. Accordingly the adjoint of D,, which we 
will call Dj, may be obtained from D! by rearranging its rows and 
columns in the same way as the rows and columns of D were 
rearranged to give D,, and then prefixing the factor (—1)* to each 
element. 

Let us now apply the special case already established of our 
theorem to the determinant D, and its adjoint D/, the m-rowed 
minors M, and M}, being those which are situated in the upper left- 
hand corner of D, and Dj respectively. We thus get 


(4) M',= Dj" [alg. compl. of M,]. 
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Now, since M, is a principal minor, its algebraic complement is 
the same as its ordinary complement, and this in turn is the same as 
the ordinary complement of the minor Min D. Accordingly, using 
(2), we may write 


(5) alg. compl. of M, =(—1)* [alg. compl. of W]. 


Since the elements of MW} differ from those of M’ only in having 
the factor (— 1)" prefixed to each, it follows that 


(6) Mi =(—1)"M". 
We may now reduce (4) by means of (3), (5), and (6). We thus 


Bet (yma! = (— 1)" D"-( — 1)" [alg. compl. of M]. 


Cancelling out the factor (—1)"* from both sides of this equation, 
we see that our theorem is proved. 

We proceed now to point out a number of special cases of this 
theorem which are worth noting on account of their frequent occur- 
rence. 


CoroLuARyY 1. Jf a; is any element of a determinant D of the 


nth order, and if a; is the cofactor of the corresponding element A, in 
the adjoint of D, then 


— —2 


This is merely the special case of our general theorem in which 
m=n—1, modified, however, slightly in statement by the use of 
the cofactor «, in place of the (n — 1)-rowed minor (— 1)" a. 


CoRoLLARY 2. If Dis any determinant of the nth order and D! 
its adjoint, then Da pe 


This is the special case m= n. 


CoroLLARy 3. Jf D is any determinant, and S is the second 
minor obtained from it by striking out its ith and kth rows and its jth 
and Ith columns, and if we denote by A,; the cofactor of the element 
which stands in the ith row and the jth column of D, then 


LE 
Aj; Ay 


This is the special case m = 2. 


= (— Ly EDS, 
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CHAPTER III 
THE THEORY OF LINEAR DEPENDENCE 


12. Definitions and Preliminary Theorems. Two sets of con- 
stants (a,, 5,, ¢,, dy,) and (a, by, ¢, d,) are usually said to be propor- 
tional to one another if every element of one set may be obtained 
from the corresponding element of the other by multiplying by the 
same constant factor. For example, (1, 2, 8, 4) and (2, 4, 6, 8) are 
proportional. It is ordinarily assumed that ecther set may be thus 
obtained from the other, and in most cases this is true; but in the 
case of the two sets (1, 2, 3, 4) and (0, 0, 0, 0) we can pass from 
the first to the second by multiplying by 0, but we cannot pass from 
the second to the first. 

A more convenient definition, for many purposes, and one which 
is easily seen to be equivalent to the above-mentioned one, is the 
following : 


DEFINITION 1. The two sets of constants 


Ag ! 
Ly Loy °** Lng 
tt " 
Ly; Lo) see Cay 


are said to be proportional to each other tf two constants c, and cy, not 
both zero, exist such that 


ce, 2+ ea! = 0 (@= 1, 2, --- ). 
If c, #0, we have 


Pe eg oy ey a ie Pee Sm 
Pre Oe ee Toy B= — Syl, 


and if c.+0, we have 


Cc Cc 
[fee tbe Mies oo il ' 
a el OL el ao 
2 2 2) 
The two sets of constants ae ! 
Bi, Bos robe 
0,9; 0, 
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are evidently proportional, since if we take c,=0 and ¢,= any con- 
stant not zero, we have a pair of ¢’s which fulfill the requirements 
of our definition. ° 

Linear dependence may be regarded as a generalization of the 
conception of proportionality. Instead of two sets of constants we 
now consider m sets, and give the following : 


DEFINITION 2. The m sets of n constants each, 


af, xf, ea ald G= 1g a re M), 


are said to be linearly dependent if m constants ¢1, Cy) +++ Cm, not all zero, 
exist such that 


eu; + egal +.) + ea") = 0 G=1, 2, + 1). 


If this is not the case, the sets of quantities are said to be linearly 
independent. 

In the same way we generalize the familiar conception of the 
proportionality of two polynomials as follows: 


DEFINITION 3. Them polynomials (in any number of independent 
variables) f,, fo, «+ fm are said to be linearly dependent if m constants 
Cy) Cy *** Cm, not all zero, exist such that 


ay ap Cfo + pcuaate Cn tin 0. 


Tf this is not the case, the polynomials are said to be linearly inde- 
pendent.* 

The following theorems about linear dependence, while almost 
self-evident, are of sufficient importance to deserve explicit state- 
ment : 

THEOREM 1. If m sets of constants (or if m polynomials) are lin- 
early dependent, tt is always possible to express one — but not necéssarily 
any one — of them linearly in terms of the others. This set of constants 
(or this polynomial) is then said to be linearly dependent on the others. 

This is seen at once if we remember that at least one of the e’s 
is not zero. The relations (or relation) in which the ¢’s occur can, 
then, be divided through by this e. 


* We might clearly go farther and consider the linear dependence of m sets of n 
polynomials each. The two cases of the text would be merely special cases from this 
general point of view. 
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Turorem 2. If there exist among the sets of constants (or among the 
polynomials) a smaller number of sets (or of polynomials) which are linearly 
dependent, then the m sets (or the m polynomials) are linearly dependent. 


For suppose there are J sets of constants (or J polynomials) 
which are linearly dependent (1<m), then we may take for our set 
of m e’s, the 1 e’s which must exist for the 7 sets(or polynomials) and 
(m—1) zeros 


THEOREM 3. Jf any one of the m sets of constants consists exelu- 
sively of zeros (or if any one of the polynomials is identically zero), the 
m sets (or the m polynomials) are linearly dependent. 

For we may take for the ¢ corresponding to this particular set 
(or polynomial) any constant whatever, except zero, and for the other 
(m —1) e’s, (m — 1) zeros. 


13. The Condition for Linear Dependence of Sets of Constants. 
In considering m sets of n constants each, 


(1) gi), oli), ano gl] (t= ik De sins mM), 


it will be convenient to distinguish between the two cases m <n and 
m> nN. 

(4) m<n. We wish here to prove the following fundamental 
theorem : 


THEOREM 1. A necessary and sufficient condition for the linear 
dependence of the m sets (1) of n constants each, when m<n, is that all 
the m-rowed determinants of the matria 


TY A 

wl ae ae eee 
lm] [m] m 

xy Ly 200 oe ] 





should vanish. 


That this is a necessary condition is at once obvious; for if the 
m sets of constants are linearly dependent, one of the rows can be 
expressed as a linear combination of the others. Accordingly if in 
any of the m-rowed determinants we subtract from the elements of 
this row the corresponding elements of the other rows after each row 
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has been multiplied by a suitable constant, the elements of this row 
will reduce to zero. The determinant therefore vanishes. 

We come now tothe proof that the vanishing of these deter- 
minants is also a sufficient condition. We assume, therefore, that 
all the m-rowed determinants of the above matrix vanish. Let us 
also assume that the rank of the matrix is r>0* (cf. Definition 3, 
§ 7). Without any real loss of generality we may (and will) assume 
that the r-rowed determinant which stands in the upper left-hand corner 
of the matrix does not vanish ; for by changing the order of the sets 
of constants and the order of the constants in each set (and these 
orders are clearly quite immaterial) we can bring one of the non- 
vanishing 7-rowed determinants into this position. 

We will now prove that the first (r+ 1) sets of constants are lin- 
early dependent. From this the linear dependence of the m sets 
follows by Theorem 2, § 12. 

Let us denote by ¢,, ¢, --- ¢,4, the cofactors in the (r+ 1)-rowed 
determinant which stands in the upper left-hand corner of the matrix, 
and which correspond to the elements of its last column. If we remem- 
ber that all the (r+1)-rowed determinants vanish, we get the relations 

eater +--+, aI = 0 (g=rt+1,r4+2, --- n). 

Since the sum of the products of the elements of any column of a 
determinant by the cofactors of the corresponding elements of another 
column is zero, this equation is also true when 7 = 1, 2, --. r. 

This establishes the linear dependence of the first (r +1) sets of 
constants, since ¢,,,, being the r-rowed determinant which stands 
in the upper left-hand corner of the matrix, is not zero. 

(6)m>n. This case can be reduced to the one already considered 
by the following simple device. Add to each set of n constants m—n 
zeros. Wethen have m sets of m constants each. Their matrix con- 
tains only one m-rowed determinant, and this vanishes since one, at 
least, of its columns is composed of zeros. Therefore these m sets of m 
constants each are linearly dependent ; and hence the original m sets 
of n constants each were linearly dependent. Thus we get the theorem: 


THEOREM 2. 1m sets of n constants each are always linearly depen- 
dent if m>n. 
* In general we shall have r= m— 1, but r may have any value less than m. The 


' only case which we here exclude is that in which all the elements of the matrix are 
zero, a case in which the linear dependence is at once obvious. 
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EXERCISES 


Determine ‘whether the following sets of constants are linearly 


dependent or not : Sag eb teee 6d, 
ble. a, 0,- — ¢, 4d, 
0, = 5 0, —3d. 
Ne OR Xbb ay 
2 ib, Zs Gs ib 
Da ae ag (5 
5, 2 1, 3, 4; 
3. O28; 07 053, 
15h Om One 


roe dares 
Pal 

iS So SY 
| 

on) 
| 

Se Coe 
he al 
noo ee 


14. The Linear Dependence of Polynomials. Suppose we have m 
polynomials, EPS peck fe 


in any number of independent variables. A necessary and sufficient 
condition for the linear dependence of these polynomials is evidently 
the linear dependence of their m sets of coefficients. Thus the condi- 
tions deduced in the last section can be applied at once to the case of 
polynomials. 

EXERCISES 


Determine whether the following polynomials are linearly depend- 
ent or not: 16x + 802, 


1. 6a+2y+ 5z- 4, 
lba+9y = 1s 


32, +42,-—427,+ 62y 

Wa, +3%,+ 72% 

22, — 2 — 3, 
—52,+92,-— a,+ 42,4 8. 


2e7+ Say+ 6y2 +1474 12y — 4, 
(ae + y+ G6a-— 4y, 

3227— G6Gay+ 3y?— 5a + 7, 
52? + 20 xy + 15 y? + 3852 4 30y — 10. 
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15. Geometric Illustrations. The sets of n constants with which 
we had to deal in §§ 12, 13 may, provided that not all the constants 
in any one set are zero, advantageously be regarded as the homoge- 
neous codrdinates of points in space of n — 1 dimensions. It will 
then be convenient to speak of the linear dependence or independ- 
ence of these points. The geometric meaning of linear dependence 
will be at once evident from the following theorems for the 
case n = 4. 

Two points will here be represented by two sets of four constants 
one Ly Ys 2 Uy 

Voy Yor Zap bys 
which will be linearly dependent when, and only when, they are pro- 
portional, that is, when the points coincide. Hence: 


THEOREM 1. Two points are linearly dependent when, and only 
when, they coincide. 


If we have three points in space, P,, P,, P3, whose codrdinates 
are (2, Yp 2p t), (®, Yor 2a) be) (Lg Yor %g» fy), Tespectively, and 
which are linearly dependent, there must exist three constants ¢, ¢, 
¢3, not all zero, such that 

C124 + Cy Xq + 2%, = 0, 
C1Y1 + Yo + Cs¥g = 9, 
0421 + neo + C323 = 9, 
Cyt, +eyt, +egt, = 0. 
Let us suppose the order of the points to be so taken that ce, «0, and 
solve for 23, Ys) 2g) tg: Pe oe ee 
(1) Ys = yyy + ha Y op 
2, =k, 2, +hy2y 
where k, = — ¢,/¢3, kg = —¢,/¢,. Now if 
Ac+ By+ z+ Dt=0 
is the equation of any plane through the points P, and P,, we have 


Ax, + By, + Cz, + Dt,=9, 
Az, + By, + Cz,+ Di, =9. 
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Multiplying the first of these equations by #,, the second by h,, and 
adding, we have, by means of the equations (1), 


Ax, + By, + Cz, + Dt, = 0. 


Hence every plane through P, and P, passes through P, also, and 
the three points are collinear. 

Now, in order to prove conversely that any three collinear points 
are linearly dependent, let us suppose the three points P,, P,, Ps 
collinear. We may assume that these three points are distinct, as 
otherwise their linear dependence would follow from Theorem 1. 
We have seen that when three points are linearly dependent, the line 
through two of them contains the third. Hence if we let 


Y= Ney Peta, 
Yy! = ky, + keoYoy 
a! = hz, + heey 
U=k,t, + kale 


where k, and k, are two constants, not both zero, the point (2’, y’, 
z', t’) or P' lies on the line P,P,, and our theorem will be established 
if we can show that the constants 4, and k, can be so chosen that 
the points P’ and P, coincide. Now let eax + by + cz+dt=0 be the 
equation of any plane through the point P, but not through P, or P,. 
Thus P, is determined as the intersection of this plane with the line 
P,P,, so that if P’, which we know lies on P,P,, can be made to lie 
in this plane, it must coincide with P, and the proof is complete. 
The condition for P’ to lie in this nian is az! + by' + cz’ +dt'= 0. 
Substituting for a’, y’, 2’, ’ their values given above, we have 


kan, + by, +ez, + dt,) + kar, + by, +02, + dt.) =0. 


But neither of these parentheses is zero, since the plane does not pass 
through P, or P,, hence we may give to k, and &, values different 
from zero for which this equation is satisfied. We have thus proved 


THEOREM 2. Three points are linearly dependent when, and only 
when, they are collinear. 


The proofs of the following theorems are left to be supplied by 
the reader. It will be found that some of them are readily proved 
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from the definition of linear dependence, as above, while for others 


it is more convenient to use the condition for linear dependence ob- 
tained in § 18. » 


THEOREM 3. Four points are linearly dependent when, and only 
when, they are complanar. 


THEOREM 4. Five or more points are always linearly dependent. 

Another geometric application is suggested by the following con- 
siderations: 

A set of n ordinary * quantities is nothing more nor less than a 
complex quantity with n components (ef. § 21).. Our first definition of . 
linear dependence is therefore precisely equivalent to the following: 

The m complex quantities 

Ay, Uns +++ Bn 


are said to be linearly dependent if m ordinary quantities c,, Cy, -- 
not all zero, exist such that: 


C10 + Coll + <> + Cn Gm = 9. 


. Cms 


Now the simplest geometric interpretation for a complex quantity 
with m components is as a vector in space of n dimensions,¢ and we 
are thus led to the conception of linear dependence of vectors. The 
geometric meaning of this linear dependence will be seen from the 
following theorems for the case n = 3: 


THEOREM 5. Two vectors are linearly dependent when, and only 
when, they are collinear. 


THEOREM 6. Three vectors are linearly dependent when, and only 
when, they are complanar. 


THEOREM 7. Four or more vectors are always linearly dependent. 


In order to get a geometric interpretation of the linear dependence 
of polynomials, we must consider, not the polynomials themselves, 
but the equations obtained by equating them to zero. We speak of 
these equations as being linearly dependent if the polynomials are 


* Two different standpoints are here possible according as we understand the term 
ordinary quantity to mean real quantity, or ordinary complex quantity. 

+ There are of course other possible geometric interpretations. ‘Thus in the case 
n= 4 we may regard our complex quantities as quaternions, and consider the meaning 
of linear dependence of two, three, or four quaternions. 
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linearly dependent. If then we regard the independent variables as 
rectangular codrdinates, these equations give us geometric loci in 
space of as many dimensions as there are independent variables. 
Thus, in the cases of two and three variables, we have plane curves 
and surfaces respectively. The case of two loci is of no interest, 
as they must coincide in order to be linearly dependent. In the case 
of three linearly dependent loci it is easily shown that any one must 
meet the other two in all their common points and in no others. 

The following theorems will serve to illustrate the Brome mean- 
ing of linear dependence : 

(1) In the plane : 5 


THEOREM 8. Three circles are linearly dependent when, and only 
when, they belong to the same coaxial family. 


THEOREM 9. Four circles are linearly dependent when, and only 
when, they have a (real or imaginary) common orthogonal circle. 


THEOREM 10. Four circles are linearly dependent when, and only 
when, the points of intersection of the first and second, and the points of 
intersection of the third and fourth, lie on a common cirele. 


THEOREM 11. Five or more circles are always linearly dependent. 


(2) In space (using homogeneous coérdinates) : 


THEOREM 12. Three planes are linearly dependent when, and only 
when, they intersect in a line. 


THEOREM 13. Four planes are linearly dependent when, and only 
when, they intersect in a point. 


THEOREM 14. Five or more planes are always linearly dependent. 


CHAPTER IV 
LINEAR EQUATIONS 


16. Non-homogeneous Linear Equations. In every elementary 
treatment of determinants, however brief, it is explained how to 
solve by determinants a system of n equations of the first degree in 
'n unknowns, provided that the determinant of the coefficients of 
the unknowns is not zero. Cramer’s Rule, by which this is done, 
is this: 


CRAMER’S RULE. (Jf in the equations 


A440 + eee oe Ain Xn, = ky 


Any Vy H+ 20+ + yn Ly, = Its 
the determinant 


M1 "Un 
a= ere 
Any Onn 


is not zero, the equations have one and only one solution, namely : 


a a 
ts cae 


a 
=7, t=, n 


x 
1a 


where a, is the n-rowed determinant obtained from a by replacing the 
elements of the ith column by the elements ky, kg, +++ ky. 

This rule, whose proof we assume to be known,* is of funda- 
mental importance in the general theory of linear equations to 
which we now proceed. 


* The proof as given in most English and American text-books merely establishes 
the fact that if the equations have a solution it is given by Cramer’s formule. That 
these formule really satisfy the equations in all cases is not commonly proved, but 
may be easily estabished by direct substitution. We leave it for the reader to do this. 

43 
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Consider the system of m linear equations in ” variables: 


yy Ly + Va + A ynXy +5, =9, 


An V4 Be Onlin + Om = 0, 


where m and n may be any positive integers. Three cases arise: 
(1) The equations may have no solution, in which case they are 
said to be inconsistent. 
(2) They may have just one solution. 
(3) They may have more than one solution, in which case it will pres- 
ently appear that they necessarily have an infinite number of solutions. 
Let us consider the two matrices: 


Peetu G4 ee 


Amy Ann Amy soe Ann gs 


We will call a the matrix of the system of equations, b the aug- 
mented matrix. 

It is evident that the rank of the matrix a cannot be greater 
than that of the matrix b, since every determinant contained in a 
is also contained in b. We have, then, two cases: 

I. Rank of a= Rank of b. 

II. Rank of a< Rank of b. 

We will consider Case II first. 

Let r be the rank of b. Then b must contain at least one 
r-rowed determinant which is not zero. Moreover, this determinant 
must contain a column of b’s, since otherwise it would be contained 
in a also, which is contrary to our hypothesis. Suppose for definite- 
ness that this non-vanishing 7-rowed determinant is the one situ- 
ated in the upper right-hand corner of b. ‘There is no loss of 
generality in assuming this, since by writing the equations in a dif- 
ferent order and changing the order of the variables 2,, --- x, we 
can always bring the determinant into this position. Now for 
brevity let us represent the polynomials forming the first members 
of our given equations by F,, F,, --- F, respectively, and the 
homogeneous polynomials obtained by omitting the constant terms 
in each of these equations by fj, fy + tm Then we have the 


identities: F=f, +b; (t= Loins Ie 
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Consider the first r of these identities. Since the rank of a is 
less than r, the polynomials f,, f,, --- f, are linearly dependent, 


at; + Gta i +¢F-= 9, 
hence Fy +--+ +¢,f,=¢6,++-+¢,b.= C. 


But since the rank of b is r, the polynomials F,, --- F, are linearly 
independent and therefore (#0. Hence the given equations are in- 
consistent, for if they were consistent all the F’s would be zero for 
some suitably chosen values of z,, --- z,, and if we substitute these 
values in the last written identity we should have 


0=C0+0. 


Let us now consider Case I. Let r be the common rank of a 
and b, then there is at least one 7-rowed determinant in a which 
is not zero. ‘This same determinant also occurs in b. Suppose it to 
be situated in the upper left-hand corner of each matrix. Since all 
(r+ 1)-rowed determinants of either matrix are zero, the first (r+ 1) 
of the F’s are linearly dependent, and we have 


CB Hg hg te FOP, + Cp Peay = 93 


and, since F,, --- F, are linearly independent, ¢,,, cannot be zero ; 
hence we may divide through by it and express ¥,,, linearly in 
‘terms of F,--- F.. The same argument holds if instead of F,,, we 
take F.,, or any other one of the remaining #’s. Hence 


Fj =kY Fy ++ + kOF, (J=1, 2, --»m—r). 


From these identities it is obvious that at any point (2, ---x,) where 
F,, --- F, all vanish, the remaining #”’s also vanish. In other words, 
any solution which the first r equations of the given system may 
have is necessarily a solution of the whole system. 

Now consider the first 7 of the given equations. Assign to 
Lp °° + L, any fixed values 2},,,--+ 2},, and transpose all the terms 
after the rth in each equation to the second member, 


I / 
Wy %y oF Op Lp = — Ay, pp yUp4y — 21° — Un%n — b,, 


= I es — 
ApH Ft HF Up Lp = — Up, yLp44 — ** Ayn Xn Bye 
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Remembering that the right-hand sides of these equations are known 
constants, and that the determinant of the coefficients on the left is 
not zero, we see that we have the case to which Cramer’s Rule 
applies, and that this system of equations has therefore just one 
solution. Hence the given system of equations is consistent, and we 
have the theorem: 


THEOREM 1. A necessary and sufficient condition for a system of 
linear equations to be consistent is that the matrix of the system 
have the same rank as the augmented matrix. 


From the foregoing considerations we have also 


THEOREM 2. Jf in a system of linear equations the matrix of 
the system and the augmented matrix have the same rank r, the values 
of n—r of the unknowns may be assigned at pleasure and the others 
will then be uniquely determined. 

The n—r unknowns whose values may be assigned at pleasure may 
be chosen in any way provided that the matrix of the coefficients of 
the remaining unknowns is of rank r. 


) 


EXERCISES 
Solve completely the following systems of equations: 


2a— y+3z2-— 1=0, 
4a—2y— z+ 8=0, 


1. 
2e-— y—4z24+ 4=0, 
10x —5y—6z24+10=0. 
4x— y+ z2+5=0, 

a 2x—8y+5z2+1=0, 
+ y—22+2=0, 
52 — 2z+2=0. 
2a—d3y+42-— w=3, 

3 x+2y— 24+2w=1, 


38x— y+2z2-—38w=4, 
sa—- y+ z2-Tw=4, 
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17. Homogeneous Linear Equations. We will now consider the 
special case where the equations of the last section are homogeneous, 
z.e. where all the 6’s are zero, 

Ay Uy + + Ay, Ly = 0, 


Any? Ft Fanny = 0. 
The matrices a and b of the last section differ here only by a column 
of zeros ; hence they always have the same rank and this is called 


the rank of the system of equations. Theorems 1 and 2 of the last 
section become 


THEOREM 1. A system of homogeneous linear equations always 
has one or more solutions. 


THEOREM 2. If the rank of a system of homogeneous linear equa- 
tions in n variables is r, the values of n—r of the unknowns may be 
assigned at pleasure and the others will then be uniquely determined.* 

If the rank of the equations is n, there will therefore be only one 
solution, and this solution is obviously 7, =2,=---=2,=0. Since 
the rank can never be greater than n, we have 


THEOREM 3. A necessary and sufficient condition for a system of 
homogeneous linear equations in the n variables (x, --- X,) to have a solu- 
tion other than 2, = % = ++: = % = 0 ts that their rank be less than n. 


CorotuARry 1. Jf there are fewer equations than unknowns, the 
equations always have solutions other than xv, = % = +: = Lp = 0. 


CoroLuARy 2. If the number of equations is equal to the number 
of unknowns, a necessary and sufficient condition for solutions other than 
Ly =U, = +: = x, = 0 ts that the determinant of the coefficients be zero. 

In the special case where the number of equations is just one less 
than the number of unknowns and the equations are linearly 

independent, we will prove the following: 


THEOREM 4. very set of values of %4,-+:%, which satisfies a 
system of n —1 linearly independent,+ homogeneous linear equations in 


* Cf. also the closing lines of Theorem 2, § 16. 
+ The theorem is still true if the equations are linearly dependent, but it is then 
trivial, since the determinants in question are all zero. 
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n unknowns is proportional to the set of (mn —1)-rowed determinants 
taken alternately with plus and minus signs, and obtained by striking 
out from the matrix of the coefficients first the first column, then the 
second, ete. 


Let us denote by a; the (n —1)-rowed determinant obtained by 
striking out the 7th column from the matrix of the equations. Since 
the equations are linearly independent, there must be at least one 
of the determinants aj, a, -+- a Which is not zero. Let it be q;. 
Now assign to 2; any fixed value, ¢, and transpose the zth term of 
each equation to the second member and we have 


yy Vy ee Fy Ue + ty Vi FoF Ayn TH = — Mie, 
An —4, 104 oe 1 Mp4, 5-1 Vi-1 F Ung, 141 Vit Ft TF In—-4,nF%n = — M1, i 
—1)'*-¢-a 
Hence: mee fe (k =1, 2, --- n), 
a; 


v 


from which it is clear that (2, --- x,) are proportional to the de- 
terminants (a1, — dy, dg, --- (— 1)""1a,), as was to be proved. 

The theory of homogeneous linear equations has here been de- 
duced from the theory of linear dependence. It can, however, in 
turn be used to obtain further results in this last-mentioned theory. 
As an example of this we will deduce the: following theorem, which 
we shall find useful later : 


THEOREM 5. Jf a set of points (a, --- x), finite or infinite in 
number, have the property that k points can be found among them upon 
which every other point of the set is linearly dependent, then any k+1 
points of the set will be linearly dependent. 


Let (vj, + w), (af, al), + (cl, .-- cll) be the & points upon 
which every other point of the set is linearly dependent, and let 


(Xj es DAU SR, Son AC cea ea 
be any +1 points of the set. Then we may write 
Xf! = of a, + olf i +4 <Yain + cial, 
(1) . . . ° . . . . . . . G@=1, 2, --k+1) 


Xl = lla! + ofall +... + efile, 
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This is true by hypothesis if (X{!, --. X/) is not one of the first % 
points, and if it is one of these points, it is obviously true. We 
have then to prove that & + 1 constants, C1, Cy +++ Cys, not all zero, 
can be found such that 


Cy Xj¢+ O,XY +--+ Chi, Xf = 0 (J =1, 2, --- n). 


By substituting here the values of the X’s from (1), we see that 
these equations will be fulfilled if 


Cre, - Coe a see a Ce" 5 = 0, 


C1e, + Cyeg +--+ + Chie, = 0, 


and this is a system of fewer equations than unknowns, which is. 
therefore satisfied by a set of C’s not all zero. (Cf. Theorem 8, 
Cor. 1.) 


EXERCISES 
Solve completely the following systems of equations: 


llz+ 8y—224+ 38w=0, 
22+ 83y— z+ 2w=0, 
7zx— yt z— 38w=0, 
4x—lly+5z-12w=0. 


242-— 3y+524+ 8w=0, 
2.,;4¢2- yt z+ w=), 
Ba— 2y4+382+ 4w=0. 


18. Fundamental Systems of Solutions of Homogeneous Linear 
Equations. If (2},---2/,) is a solution of the system of equations 


4,24 a eiehs: a AynXLn = 0, 


(1) 


sit Se) “eee ee eet 1s 


Am V1 Bac Ann®n = 0, 


then (cz},---ez},) is also a solution, and by giving to ¢ different values 
we get thus (except in the special case in which the ws are all zero) 
an infinite number of solutions. These may include all the solutions 
of (1) (cf. Theorem 4 of the last section), but in general this will 
not be the case. 

E 
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Suppose, again, that (vj, --- 2) and (z/, --- 2) are two solutions of 
(1), then (e274 + ¢,2], +++ ¢¢;, + 47) 18 also a solution. If the two 
given solutions are proportional to each other, this clearly gives us 
nothing more than what we had above by starting from a single 
solution ; but if these two solutions are linearly independent, we 
build up from them, by allowing ¢,and ¢, to take on all values, a 
doubly infinite system of solutions; but even this system will 
usually not include all the solutions of (1). Similarly we see 
that, if we can find three linearly independent solutions, we can 
build up from them a triply infinite system of solutions, etc. If, 
proceeding in this way, we succeed in finding a finite number of 
linearly independent solutions in terms of which all solutions can 
be expressed, this finite number of solutions is said to form a 
fundamental system. 


DerrinitIon. Jf (aii, ... cl) (¢=1, 2,---k) are a system of k 
solutions of (1) which satisfy the following two conditions, they are said 
to form a fundamental system: 

(a) They shall be linearly independent. 
(6) Every solution of (1) shall be expressible in the form 


(c,2' +. Coty Sr ctl, 3600re Cit, + Gis 4... +¢,04), 


THEOREM 1. Jf the equations (1) are of rank r <n, they possess 
an infinite number of fundamental systems each of which consists 
of n—r solutions. 


Suppose the r-rowed determinant which stands in the upper left- 
hand corner of the matrix of the equations (1) doés not vanish, and 
let us consider the first r of these equations. Any solution of these 
will be a solution of all the others. Transpose all terms after the 
rth to the second members, and let (2,,,, --- 2) have any fixed set 
of values (2}.,,, +++ 2), not all zero; then these r equations will have 
just one solution given by Cramer’s Rule. Call it (ay oe 
Now let (#,44, «++ %) have any other fixed set of values (Spas (ey 
not all zero, and we get another solution, (2, ... 2!’). Continue in 
this way until we have n—r solutions 


G eee G 
Uy, 8** Loy Uryyy 28* Any 


afr... oot, .. afta, 
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If we have chosen these n — r sets of values for (2,4, +++ %) so that 
the determinant 


(2) 


I / 
Cy 44 see Ly 


ans eat gin-r] 


is not zero, —and this may clearly be done in an infinite variety of 
ways, — these n —r solutions will be linearly independent. That is 
to say, we may thus obtain an infinite number of sets of n—,r solu- 
tions each, each of which satisfies condition (a) of our definition for 
a fundamental system. 

To prove that these sets of solutions also satisfy condition (6), let 
us suppose that (Xj, --- X,) is any solution of the r equations we are 
considering. The last n—vr of these X’s are linearly dependent on 
the n —r sets of values we have chosen for (,,,, --- Z,) since we have 
here more sets of constants than there are elements in each set (cf. 
Theorem 2, §13), and the determinant (2) is not zero. Thus 


(83) X,=e¢2)+ e27]+---+¢,_,0-71 (@=r+1, r+2, --- n). 


Let us now solve the first 7 equations (1) by Cramer’s Rule, regard- 
ing Z44, +++ Z, as known. We thus get results of the form 


vj = Aj py + APE psy 35 aad =f= Apy,, G a it: eee 1). 
By assigning special values here to 2,,,, ++ Zp, we get 
wa Alo. + Afra $+ Aln-rly 


(4). (721,27) 
rnd Poe orl 4. Ala no .. Lb Alp—rgln-a, 


X= a Aj X44 £3 Aj X49 Ga a AY Xs 


_ If we multiply the first n —7 of these equations by ¢, ++ ¢,_, re- | 
spectively and add, we get, by (3), 
Crp + Cy ar = ALK be + APL, 
Consequently, by the last equation (4), 
(5) Ky = 60, + ++ Cy (Fel, 2,.--7), 
Equations (8) and (5) together prove our theorem. 


52 INTRODUCTION TO HIGHER ALGEBRA 


We thus see that the totality of all solutions of the system (1) 
forms a set of points satisfying the conditions of Theorem 5, §17. 
Consequently, 


TurorEeM 2. Jf the rank of a system of homogeneous linear equa- 
tions in n variables is r, then any n—r+1 solutions are linearly 
dependent. 


Finally we will prove the theorem: 


THEOREM 8. A necessary and sufficient condition that a set of 
solutions of a system of homogeneous linear equations of rank r in n 
variables form a fundamental system is that they be 


(a) linearly independent, 

(6) n—r in number. 

By definition, (a) is a necessary condition. ‘To see that (0) also 
is necessary, notice that by Theorem 2 there cannot be more 
than n—vr linearly independent solutions. We have, then, 
merely to show that 7 linearly independent solutions never form 
a fundamental system when J<n—vr. If they did, then by 
Theorem 5, $17, any set of 7+ 1 solutions would be linearly 
dependent, and therefore the same would be true of any set of 
n —r solutions (since n—r21+1). But by Theorem 1, this 
is not true. 

In order now to prove that conditions (a) and (4) are also 


sufficient, let 
(al, a see all) (2 ss 1, 2. r) 


be any system of n — r linearly independent solutions of our system 
of equations, and let (2, ... 2) be any solution of the system. 
Then, by Theorem 2, we have n —r+1 constants (Gq, ... Cp_-+1)s 
not all zero, and such that 


0405 Coty! vee Cy OT Cyn 1 Bj =O G=1, 2, 2 


But since the n — r given points are linearly independent, ¢,_,.4., # 05 
accordingly these last equations enable us to express the solu- 
tion (a, ... @,) linearly in terms of the m — r given solutions, 
and this shows that these » — 7 solutions form a fundamental 
system. 
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EXERCISES 


1. Prove that all the fundamental systems of solutions of a system of homo- 
geneous linear equations are included in the infinite number obtained in the 
proof of Theorem 1. 


2. Given three planes in space by their equations in homogeneous coérdinates. 
What are their relative positions when the rank of the system of equations is 3? 
when it is 2? when it is 1? 


3. Given three planes in space by their equations in non-homogeneous coérdi- 
nates. What are their relative positions for the different possible pairs of values 
of the ranks of the matrices and augmented matrices ? 


CHAPTER V 
SOME THEOREMS CONCERNING THE RANK OF A MATRIX 


19. General Matrices. In order to show that a given matrix 
is of rank 7, we have first to show that at least one r-rowed deter- 
minant of the matrix is not zero, and secondly that all (7 + 1)- 
rowed determinants are zero. This latter work may be considerably 
shortened by the following theorem : 


TurorEM 1. Jf in a given matrix a certain r-rowed determinant 
is not zero, and all the (r+1)-rowed determinants of which this r-rowed 
determinant is a first minor are zero, then all the (r+1)-rowed deter- 
minants of the matrix are zero. 

We will assume, as we may do without loss of generality, that 
the non-vanishing r-rowed determinant stands in the upper left- 
hand corner of the matrix. Let the matrix be 


411° Cn 


Any °°’ Umn 


and consider the r+ 1 sets of » quantities each which lie in the first 
r +1 rows of this matrix. These r+ 1 sets of quantities are linearly 
dependent, as will be seen by reference to the proof of Theorem 1, 
§18, for although we knew there that all the (7+1)-rowed deter- , 
minants were zero, we made use of this fact only for those (7 +1)- 
rowed determinants which we now assume to be zero. Moreover, 
since the 7 sets of constants which stand in the first 7 rows of our 
matrix are linearly independent, it follows that the (r+ 1)th row is 
linearly dependent on the first r. Precisely the same reasoning 
shows that each of the subsequent rows is linearly dependent on the 
first rrows. Accordingly, by Theorem 5, §17, any r+1 rows are 
linearly dependent ; and therefore, by Theorem 1, § 13, all the (r+1)- 
rowed determinants of our matrix are zero, as was to be proved. 
: 54 
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Still another method of facilitating the determination of the rank 
of a matrix is by changing the form of the matrix in certain ways 
which do not change its rank. In order to explain this method, we 
begin by laying down the following definition : 


DEFINITION 1. By an elementary transformation of a matrix we 
understand a transformation of any one of the following forms : 

(a) the interchange of two rows or of two columns ; 

(6) the multiplication of each element of a row (or column) by the 
same constant not zero ; 

(e) the addition to the elements of one row (or column) of the prod- 
ucts of the corresponding elements of another row (or column) by one and 
the same constant. 

It is clear that if we can pass from a matrix a to a matrix b by one 
of these transformations, we can pass back from b to a by an elemen- 
tary transformation. 


DEFINITION 2. Two matrices are said to be equivalent tf it is possi- 
ble to pass from one to the other by a finite number of elementary trans- 
formations. 


THEOREM 2. If two matrices are equivalent, they have the same rank. 


It is evident that the transformations (a) and (6) of Definition 1 
do not change the rank of a matrix, since they do not affect the van- 
ishing or non-vanishing of any determinant of the matrix. In order 
to prove our theorem, it is therefore sufficient to prove that the rank 
of a matrix is not changed by a transformation (ce). 

Suppose this transformation consists in adding to the elements of 
the pth row of a matrix a & times the elements of the gth row, 
thus giving the matrix b. Let r be the rank of the matrix a. We 
will first show that this rank cannot be increased by the transforma- 
tion, that is, that all (r+ 1)-rowed determinants of the matrix b are 
zero. By hypothesis all the (r +1)-rowed determinants of the 
matrix a are zero, and some of these determinants are clearly not 
changed by the transformation, namely, those which do not contain 
the pth row, or which contain both the pth and the gth row. The 
other determinants, which contain the pth row but not the gth, take 
on after the transformation the form A+B where A and B are 
(r+1)-rowed determinants of a, and are therefore zero. Thus we 
see that the transformation (c) never increases the rank of a matrix. 
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Moreover, the rank of b cannot be less than that of a, for then the 
transformation (¢) which carries b into a would increase the rank of 
b, and this we have just seen is impossible. 

This theorem can often be used to advantage in determining the 
rank of a matrix, for by means of elementary transformations it is 
often easy to simplify the matrix very materially. 


EXERCISES 


Determine the ranks of the following matrices: 


1. 1A IS). G6 Same? 
6° 104 21 17 
. 7 Bs. coo 8 4 eel 
25. 20. wi5_ 20° 9s 
2. 15 0. J 116 = 39 0 
171 <= 69 402 123 45 
301 0 87 417 = 169 


114 — 46 268 82 30 


3. Prove that any matrix of rank r can be reduced by means of elementary 
transformations to a form where the element in the ith row and ith column is 1 
when i<r, while all the other elements of the matrix are zero. 


4. Hence prove that two matrices with m rows and n columns each are always 
equivalent when they have the same rank. 


5. Prove that a necessary and sufficient condition that the matrix 
Air *** Ay 


Ami *** Amn 
be of rank 0 or 1 is that there exist m + n constants ai, +++ an, 81, +++ By, such+that 
ay = 04 Bj. 
20. Symmetrical Matrices. 
DEFINITION. The square matrix 


yy AQ >> Ayn 
G1 19 °** Fon 
a= 


Any Ang Ann, 


(and also its determinant) is said to be symmetrical if the pairs of terms 
which are situated symmetrically with respect to the principal diagonal 
are equal. That is, tf a, = a;;. 


a 
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We will denote by M, an t-rowed principal minor of a. It is 
our main object in this section to show how the rank of the symmet- 
rical matrix may be determined by an examination of the principal 
minors only. ‘This may be done by means of the following three 
theorems. 


THEOREM 1. Jf an r-rowed principal minor M, of the symmetrical 
matriz ais not zero, while all the principal minors obtained by adding 
one row and the same column, and also all those obtained by adding two 
rows and the same two columns, to M, are zero, then the rank of a is r. 


Let the non-vanishing minor be the one which stands in the upper 
left-hand corner of a, and let B,, denote the determinant obtained by 
adding the «th row and the Sth column to M,. If we can show that 
B,,= for all unequal values of « and 8 our theorem will be proved. 
Cf. Theorem1,§19. Give to the integers « and 8 any two unequal 
values, and let C’ denote the determinant obtained by adding to M/, the 
ath and Sth rows and the ath and Sth columns ofa. Then we have, 
by hypothesis, M+ 0, B,,=9, Bsz=9, C=0. Let Mz be the two- 
rowed principal minor of the adjoint of C which corresponds to the 
complement of M,in C. Then by Corollary 3, § 11, we have 


M.= CM, =0. 
But My =B,,P pp — Bie: 
Therefore B= 9. 


THEOREM 2. Jf all the (r+1)-rowed principal minors of the sym- 
metrical matrix a are zero, and also all the (r+2)-rowed principal 
minors, then the rank of ais r or less. 

If r=0, all the elements in the principal diagonal are zero and all 
the two-rowed principal minors are zero. 


“| ites 
That 18, Ay . Aj; —ih gq 0, 


and therefore, since a,=a,=0, a,;=0. That is, every element is 
zero and hence the rank is zero, and the theorem is true in this 
- special case. 

Now, assume it true when r=; that is, we assume that when all 
(z+ 1)-rowed principal minors are zero and all (k+2)-rowed principal 
minors are zero, the rank of a isless than k+1. Then it follows that 
when all (4 +2)-rowed, and all (4+ 3)-rowed principal minors are zero, 
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the rank of a is less than +2. For in this case, if all (4 + 1)-rowed 
principal minors are zero, the rank is less than &+1, by hypothesis, 
and if some (£+1)-rowed principal minor is not zero, the rank is ex- 
actly k +1, by the last theorem. We see then that if the theorem is 
true for r=hk itis truefor7=k+1. But we have proved it true for 
r = 0,hence it is true for all values of ». 

THEOREM 8. If the rank of the symmetrical matrix a is r >0, there 
is at least one r-rowed principal minor of a which is not zero. 

For all (7 +1)-rowed principal minors are zero, and, if all r-rowed 
principal minors were zero also, the rank of a would be r—1 or less, 
by the last theorem. 

We close with a theorem of a somewhat special character which 
will be found useful later (cf. Exercises 4-6, § 50). 

THEOREM 4. Jf the rank of the symmetrical matrix ais r>0, we 
may shift the rows (at the same time shifting the columns in the same 
way, thus keeping a symmetrical) in such a way that no consecutive two 


of the set of quantities M,, My My... M, 


shall be zero and M,#+0; M, being unity, and the other M’s being the 
principal minors of a of orders indicated by their subscripts, which stand 
in the upper left-hand corner of a after the shifting. 

By definition we have M,+0. Leaving aside for the moment 
the special case in which all the elements of the principal diagonal are 
zero, let us suppose the element a, is not zero. Then by shifting the 
ith row and column to the first place, we have M,+0. We have 
thus fixed the first row and column, but we are still at liberty to 
shift all the others. Now consider the two-rowed principal minor 
obtained by adding to M, one row and the same column. Leaving 
aside still the special case in which these are all zero, let us suppose 
that the two-rowed determinant obtained by striking out all the rows 
and columns except those numbered 1 and 2, is not zero. Then, by 
shifting the 7th row and column into the second place, we have 
M,#+9. We next have to consider the three-rowed principal minors 
of which M, isa first minor. We can evidently proceed in this way 
until we have so shifted our rows and columns that none of the quan- 
tities M,, M,, ... M, are zero, unless at a certain stage we find that 
all the principal minors of a certain order which we have to consider 
are zero. In this case we should have so shifted our first k rows and 
columns that none of the quantities WM, Mj, ... M;, are zero, but we 
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should then find that all (4+1)-rowed principal minors of which 1, 
is a first minor vanish, so that, however we may shift the last n—k 
rows and columnsy we have M,,,=0. Let us then examine the 
(£+2)-rowed principal minors of which M, is a second minor.* 
These can (by Theorem 1) not all be zero as otherwise the rank of a 
would be&<r. That is, if M,,=0, we can so arrange the rows 
and columns that M@,,,#0. Thus we see that the rows and col- 
umns of a may be so shifted that no consecutive two of the W’s are 
zero. Now, if M,_,=0, the above proof shows that we can make 
M,+0. But even though M,_,+0 we can still make /,+0, for by 
hypothesis ¢ all the determinants obtained by adding to M,_, two 
rows and the same two columns vanish, and if all those obtained by 
adding one row and the same column were zero also, the rank of a 
would be r—1, by Theorem 1. 

A symmetrical matrix is said to be arranged in normal form when 
no consecutive two of the M’s of Theorem 4 are zero and M+ 0. 








_ EXERCISES 
1. Determine the ranks of the following matrices: 
2 p aes 6 2 O24 5107 1 
1 0 4 1), 4 Sse Anis 
ii 4 56 5 10 18 40 17 
2 -1 5 —6 aly 6S AO 8} 
1 0 0 Ligne 0 1 b d 
0 1 0 2 5 1 um we e 
0 0 1 3 Gill b c 2be cd+be 
1 2 38 14 32 d e cd+be 2de 
4 5 eR diy 


2. By a skew-symmetric determinant, or matrix, is meant one in which ay= 
— aj; (and therefore aii = 0). 

Establish for such matrices theorems similar to Theorems 1, 2, 3 of this section. 

3. By considering the effect of changing rows into columns, prove that a skew- 
symmetric determinant of odd order is always zero. 

4. Prove that the rank of a skew-symmetric matrix is always even. 


* The tacit assumption is here made that when k=r—1, r<n, as otherwise My+9 
would have no meaning. The case r=n can, however, obviously not occur here, for 
then we should have M@41=a 40. : 

+ Here again we assume that r<n, for if r=n, M,=a+0. 


CHAPTER VI 


LINEAR TRANSFORMATIONS AND THE COMBINATION 
OF MATRICES 


21. Matrices as Complex Quantities. We have said in $7 that a 
matrix of m rows and ” columns is not a quantity, but a set of mn 
quantities. This statement is true only if we restrict the term 
quantity to the real and complex quantities of ordinary algebra. A 
moment’s reflection, however, will show that the conception of quan- 
tity as used in arithmetic and algebra has been gradually enlarged 
from the primitive conception of the positive integer by using the 
word quantity to denote entities which, at an earlier stage, would 
not have been regarded as quantities at all, as, for instance, nega- 
tive quantities. We will consider here only one of these extensions, 
namely the introduction of complex quantities, as this will lead us to 
look at our matrices from a broader point of view. 

If we have objects of two or more different kinds which can be 
counted or measured, and if we consider aggregates of such objects, 
we get concrete examples of complex quantities, as, for instance, 
5 horses, 3 cows, and 7 sheep. A convenient way to write such a 
complex quantity is (5, 8, 7), it being agreed that, in the illustra- 
tion we are considering, the first place shall always indicate horses, 
the second cows, and the third sheep. In the abstract theory of 
complex quantities we do not specify any concrete objects such as 
horses, cows, etc., but merely consider sets of quantities (couples, 
triplets, etc.), distinguishing these quantities by the position they 
occupy in our symbol. Such a complex quantity we often find it 
convenient to designate by a single letter, 


“a= (a, b, c) 


just as in ordinary algebra we denote a fraction (2 for instance), 

which really involves two numbers, by a single letter. We speak 

here of the simple quantities a, 6, c of which « is composed as its first, 
60 
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second, third components ; and we call two complex quantities equal 
when and only when the components of one are equal respectively to 
the corresponding components of the other. Similarly a complex quan- 
tity is said to vanish when and only when all of its components are zero. 
What makes it worth while to speak of such sets of quantities as 
complex quantities is that it is found useful to perform certain alge- 
braic operations on them. By the sum and difference of two complex 
ee Oy = (Ay, Oy C4), Oty = (ay bay 2) 
we mean the two new complex quantities 
ay + Oy = (a, = Q25 b, + 65. Cy +2), Oy — 6 = (a, — Ag, b, — Oo. Cy = Gp): = 


When it comes to the question of defining what we shall under- 
stand by the product of two complex quantities, things are by no 
means so simple. It is necessary here to lay down some rule accord- 
ing to which, when two complex quantities are given, a third, which 
we call their product, is determined. Such rules may be laid down 
in an infinite variety of ways, and each such rule gives us a different 
system of complex quantities. f 

We come now to the subject of matrices. A matrix of m rows 
and m columns being merely a set of mn quantities (which we 
- assume to be either real quantities or the ordinary complex quantities 
of elementary algebra) arranged in a definite order, is, according to 
the point of view we have explained, a complex quantity with mn 
components; and it is only a special application of the theory of 
complex quantities which we have sketched, when we lay down the 
following definitions: 


DEFINITION 1. A matrix is said to be zero when and only when all 
of its elements are zero. 


DEFINITION 2. Two matrices are said to be equal when and only 
when they have the same number of rows and of columns, and every 
element of one is equal to the corresponding element of the other. 


* That this is the natural meaning to be attached to the terms swm and difference 
will be seen by reference to the concrete illustration given above. 

+ If, in particular, we wish to introduce the ordinary system of complex quanti- 
ties of elementary algebra, we use a system of couples, and define the product of two 


couples, = (4, 01), G2 = (a2, 52), 
by the formula 0t1 0g = (402 — byb2, ibe + a2b1). 
For further details cf. Burkhardt’s Funktionentheorie, §§ 2, 3. 
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DEFINITION 3. By the sum (or difference) of two matrices of m 
rows and n columns each, we understand a matrix of m rows and n col- 
umns, each of whose elements is the sum (or difference) of the corre- 
sponding elements of the given matrices. 

In order to distinguish them from matrices, we will call the 
ordinary quantities of algebra (real quantities and ordinary complex 
quantities) scalars. 

Before proceeding, as we shall do in the next section, to the 
definition of the product of two matrices, we will define the product 
of a matrix and a scalar. 


DEFINITION 4. Ifa is a matrix * and k a scalar, then by the prod- 
uct ka or ak we understand the matrix each of whose elements is k 
times the corresponding element of a. 


As an obvious consequence of our definitions we state the 
theorem: 


THEOREM. All the laws of ordinary algebra hold for the addition 
or subtraction of matrices and their multiplication by scalars. 


For instance, if a, b, c are matrices, and k, 7 scalars, 
a+b=b+a, 
a+(b+c)=(a+b)+¢, 
ka+kb=k(a+b), 
ka+la=(k+ la. 


EXERCISE 


If rj and rz are the ranks of two matrices and R the rank of their sum, prove 


that 
- RSr,+7,. 


22. The Multiplication of Matrices. Up to this point we have con- 
sidered matrices with m rows and n columns. For the sake of sim- 
plicity of statement, we shall confine our attention from now on to 
square matrices, that is to the case m=n. This involves no real loss 


* The notation here used, matrices being denoted by heavy-faced type, will be 
systematically followed in this book. 
+ We add that, as a matter of notation, we shall write 


(—1) a=~—a. 
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of generality provided we agree to consider a matrix of m rows and 
nm columns, where m+n, as equivalent to a square matrix of order 
equal to the larger of the two integers m, m and obtained from the 
given matrix by filling in the lacking rows or columns with zeros. 

The question now presents itself: How shall we define the prod- 
uct of two square matrices of the same order? It must be clearly 
understood that we are logically free to lay down here such definition 
as we please, and that the definition we select is preferable to others 
not on any @ priort grounds, but only because it turns out to be more 
useful. We select the following definition, which is suggested * by 
the multiplication theorem for determinants: 


DEFINITION 1. The product ab of two square matrices of the nth 
order is a square matrix of the nth order in which the element which lies 
in the ith row and jth column is obtained by multiplying each element of 
the ith row of a by the corresponding element of the jth column of b and 
adding the results. 


Let us denote by a, and 6, the elements in the zth row and jth 
column of a and b respectively, or, as we will say for brevity, the 
element (7,7) of these matrices. Then, according to our definition, 
the element (7, 7) of the product ab is 


(1) 61015 + Aigdgs + +++ + GinPngs 


while the element (¢, 7) in the matrix ba is 
(2) 5D jy + Og dig + +++ + Oygdin: 

Since the two quantities (1) and (2) are not in general equal, we 
obtain 


THeorEM 1. The multiplication of matrices is not in general com- 


mutative, that is, in general ab + ba. 


Let us now consider a third matrix c whose element (¢,7) is ¢;, 
and form the product (ab)c. The element (¢, 7) of this matrix is 
(yb yy + Gigd oy + 20 + Gin Ong Ory 
F(ab yp + ighyg + +++ + Ginbng)Oo3 
(3) Su 
Er Cabs af ee a ar Pt 


* Historically this definition was suggested to Cayley by the consideration of the 
composition of linear transformations; cf. § 23. 


64 INTRODUCTION TO HIGHER ALGEBRA 


On the other hand, the element (7, 7’) of the matrix a(bc) is 
Oi (Oy e15 + ae + + Binlng) 


(4) T af Portas 4 ar Paatas + oa Ponte i) 
+. 
ae ne oF <Thew, “hong or ae 


Since the two quantities (3) and (4) are equal, we have established 

THEOREM 2. The multiplication of matrices is associative, that is, 
(ab)c = a(bc). 

Finally, since the element (¢, 7) of the matrix a(b +c) isclearly 


equal to the sum of the elements (¢, 7) of the matrices ab and ac, we 
have the result 


TurorEeM 3. The multiplication of matrices is distributive, that is, 
a(b-+c)=ab+ac. 


Besides the commutative, associative, and distributive laws, there 
is one other principle of elementary algebra which is of constant use, 
namely, the principle that a product cannot vanish unless at least one 
of the factors is zero. Simple examples show that this is not true in 
the algebra of matrices. We have, for instance, 


Gapedas U Oa 0a) MO eOs 
(5) GM, G%, 0-10 0 0 | =10 0 O;=9, 
As, Azo O bs, sq Ogg 0 0 9 


whatever the values of the a’s and 6’s may be. Hence 


THEOREM 4. From the vanishing of the product of two or more 
matrices, we cannot infer that one of the factors is zero. 

The process of cancelling out non-vanishing factors which enter 
throughout an equation will, therefore, be inadmissible in the algebra 
of matrices. 

We next state a result which follows at once from the similarity 
between the theorem for the multiplication of determinants and our 
definition of the product of two matrices : 


THeorEM 5. The determinant of a matrix which is obtained by 


multiplying together two or more matrices is equal to the product of the 
determinants of these matrices. 
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The conception of the conjugate of a matrix, as defined in § 7, 
Definition 2, is an important one, and the following theorem concern- 
ing it is often useful: 


THEOREM 6. The conjugate of the product of any number of 
matrices is the product of their conjugates taken in the reverse order. 


In order to prove this theorem we first notice that its truth in the 
case of two matrices follows at once from the definition of the prod- 
uct of two matrices. Its truth will therefore follow in all cases if, 
assuming the theorem to be true for the product of n—1 matrices, 
we can prove that it is true for the product of n matrices. Let us 


write 
b= aay --- ay: 


Then, from what we have assumed, 


lame veatll i/ ii 
b'= a, --- a3 as, 


where we use accents to denote conjugates. Accordingly, 
(aja, ~~ a,)! = (ab) = dla =al, --- ala, 


and our theorem is proved. 
In conclusion we lay down the following: 


DEFINITION 2. <A square matrix is said to be singular if its deter- 
minant is zero. 


According to the convention made at the beginning of this sec- 
tion, it will be seen that all matrices which are not square are 
singular. 


EXERCISES 


1. DerinitTion. A matriz a is called a divisor of zero if a matrix b different 
Srom zero exists such that either ab = 0 or ba = 0. 

Prove that every matrix one of whose rows or columns is composed wholly of 
zeros is a divisor of zero. 

2. If it is possible to pass from a to b by means of an elementary transforma- 
tion (cf. § 19, Definition 1), prove that there either exists a non-singular matrix c 
such that 

ace p> 
or a non-singular matrix d such that 
da = b. 
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3. If all the elements of a matrix are real, and if the product of this matrix 
and its conjugate is zero, prove that the matrix itself is zero. 


4. If the corresponding elements of two matrices a and b are conjugate imagi- 
naries, and, b’ being the matrix conjugate to b, if 


ab’=0, thena=b=0. 


23. Linear Transformation. Before going farther with the 
theory of matrices we will take up, in this section and the next, the 
closely allied subject of linear transformation, which may be regarded 
as one of the most important applications of the theory of matrices. 

In algebra and analysis we frequently have occasion to introduce, 
in place of the unknowns, or variables, we had originally to deal 
with, certain functions of these quantities which we regard as new 
unknowns or variables. Such a transformation, or change of vari- 
ables, is particularly simple, and for many purposes particularly 
important, if the functions in question are homogeneous linear poly- 
nomials. It is then called a homogeneous linear transformation, or, 
as we shall say for brevity, simply a linear transformation. Hi x,, --- x, 
are the original variables, and 2}, --. 2/, the new ones, we have, as the 
formule for the transformation, 


je 
Ly = Ay Vy e+ $y Lny 


i 
Ly = Any®y + ++ + AnnLpys 


The square matrix dy" Cap 


Any *** Ann 


is called the matrix of the transformation, and the determinant of 
this matrix, which we will represent by a, is called the determinant 
of the transformation. Inasmuch as the transformation is com- 
pletely determined by its matrix, no confusion will arise if we speak 
of the transformation a. 

In most cases where we have occasion to use a transformation itis 
important for us to be able, in the course of our work, to pass back to 
the original variables, and for this purpose it must be possible, not 
merely to express aj, --- x, asfunctions of 2,, --- 2, but also to express 


@, +++ 2, as functions of 2, --- a). In the case of linear transforma- 
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tions this can in general be done. For the equations of the transfor- 
mation may be regarded as non-homogeneous linear equations in 
Ly, °++ X, and if the determinant a of the transformation is not Zero, 
they can be solved and give 


A A 
Be a Oe 


a a 
where A,,, --- A,,, are the cofactors of 4), +++ Gy, in a. 
This transformation A is called the enverse of the transformation 
a, but it must be remembered that it exists only ifa+0. A linear 
transformation for which a=0 is called a singular transformation. 
If a is non-singular, its inverse A is also non-singular, since the deter- 


minant of A is a1 (cf. Corollary 2, § 11). 


DEFINITION. The special linear transformation 


El =; ae SS IR ae =n 
whose matrix is 1° 0 =. 0 
01 -- QO 
I = 9 
iat), vert OL 


is called the identical transformation. 


The determinant of this transformation is 1. 

We turn now to the subject of the composition of linear trans- 
formations. If we introduce a new set of variables 2’ as functions 
of the original variables z, and then make a second transformation 
by introducing a third set of variables w'’ as functions of the vari- 
ables x’, these two transformations can obviously be combined and 
the variables z!’ expressed directly in terms of the 2’s. If the two 
transformations which we combine are linear transformations, it is 
readily seen that the resulting transformation will also be linear. 
The precise formule are important here, and for the sake of simplic- 
ity we will write them in the case of three variables, a case which 
will be seen to be perfectly typical of the general case. 
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Let 
he = i pore if | "1 
Dy =H AyyLy + AyyVy + AygV3y ay = 442} + byte + O43, 
/ 1b pees | fi / 
A) Ly = Agi Vy + Agg% + Ag3V3, Dy ay = by%4 + Og9% + bo5%3, 
| ieee | / / 
Uz = AniXy + Age%y + Mgg%ay 3 = by 2 + Og9%9 + dggrr, 
be two linea »sformations. Replacing the z’’s in b by their 


values from a, we 


(all = (44044 + Mgyby9 + A315 13) 24 
+ (442041 + 9019 + 39513) Xo 
+ (443041 + A93b4 + 433043) 23» 


© = (A445 91 + Ay Pag + Ag1593)%y 
+ (@49b91 + Cagbo9 + A39593)%e 
+ (443591 + ogbo9 + 433003) Xa» 


Wg = (Ayyb31 + Aq1P 39 + Ag10g3)%y 
+ (44031 + A909 + Agq033) Xe 
+ (443031 + Aagbgy + 233033) Xe: 


It will be seen that the matrix of this transformation is ba. 
Hence, 


THEorEM. Jf we pass from the variables x to the variables x! by a 
linear transformation of matrix a, and from the variables x! to the vart- 
ables x'' by another linear transformation of matrix b, then the linear 
transformation of matrix ba will carry us directly from the variables x 
to the variables x!'.* 


24. Collineation. We come now to an important geometrical 
application of the subject of linear transformation. For the sake of 
simplicity we begin with the case of three variables, which we will 
regard as the homogeneous codrdinates of points in a plane. 


The equations w! = aye + diy + et, 


(1) y' = aye + bay + eat, 
t = age + bey + eat 
* This result may be remembered conveniently by means of the following symbolic 
notation, which is often convenient. Let us denote the transformation a by the 


symbolic equation #! = a(x), and the transformation b by w/’/=b(#!). The result of 
combining these two transformations is then «#// = b(a(#)) or simply «!! = ba(@). 
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may be regarded as defining a transformation of the points of the 
plane; that is, if (#, y, t) is an arbitrarily given point, we can com- 
pute, by means of (1), the codrdinates (a’', y’, t') of a second point 
into which we regard the first point as being transformed. The 
only exception is when the computed values of 2’, y', t are all three 
zero, in which case there is no point into which the given point is 
transformed. This exceptional case can clearly occur only when the 
determinant of the transformation (1) is ze o. Let us then confine 
our attention to non-singular linear transformations. In this case, 
not only does every point (2, y, ¢) correspond to a definite point 
(2’, y', tv’), but conversely, every point (z', y’, t’) corresponds to a 
definite point (2, y, t), since the transformation (1) now has an inverse 


A, / ve / A, / 
=— — 1 cape aaa) t 

a8 £ + y + c 
B 


D 
peter Dgyy 
— t 
thes, 9 


C. Gy 
ee 2G a Seg 
a aie 


(2) w+ 


om Ss 
i i 
BIR &|h & 


where D is the determinant of (1), and A, B, OC; are the cofactors 
-in D. 
The points (2, y, ¢) of the line 


(8) ax + By+yt =0 


are transformed by means of the non-singular transformation (1) 
into points of another line, 


(4) Erie amiay, if Jonah nally 4 CAs poet yoy 0, 





as we see by using formule (2). Conversely every point of the line 
(4) corresponds, as we see by using (1), to a point on (3). That is, 
the transformation establishes a one-to-one correspondence between 
the points on the two lines (8) and (4), or, as we say, it transforms 
the line (3) into the line (4). On account of this property of trans- 
forming straight lines into straight lines, the transformation is called 
a collineation. The transformation is also known as a projective 
transformation, for it may be shown that it can be effected by pro- 
jecting one plane on to another by means of straight lines radiating 
from a point in space. 
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What we have here said in the case of two dimensions applies 
with no essential change to three dimensions. The transformation 


a! =aye+ by +c24+ dyt, 
(5) y' = age + boy + C2 + dot, 
2! = age + bey + cg2 + dt, 
th =aga+by+cze+ dyt 


gives us, provided its determinant is not zero, a one-to-one trans- 
formation of the points of space, which carries over planes into 
planes, and therefore also straight lines into straight lines, and is 
called a collineation or projective transformation of space. The same 
idea can be extended to spaces of higher dimensions. 

Quite as important is the case of one dimension. The transfor- 
mation a'= a,x + b,t, 
O) =a + bot 


gives us, provided its determinant is not zero, a one-to-one trans- 
formation of the points on a line. This we call a projective trans- 
formation of the line, the term collineation being in this case 
obviously inadequate. 

It is possible, although for most purposes not desirable, to express 
the projective transformations (6), (1), (5) in one, two, and three 
dimensions in terms of non-homogeneous, instead of homogeneous 
coérdinates. We thus get the formule 








1  44X+6, ya OX th Vt gtd 

eh tei a,X +b, aX +b,V¥+e,Z+ dy 
x uk +b,%+y (9) Jyre aX +b,V+4+eo,4+ dy 
asX + b,Y + Cy : a,X+b,Y+¢,Z4+4+ d, 


8 
(8) yi OX th Vte gi UX + ba + 60 + dy 


GX + 0, ees ~ aX +b,V+e,0+d, 








These fractional forms may, in particular, be used to advantage 
in case their denominators reduce to mere constants. This special 
case, which is known as an affine transformation, may clearly be char- 
acterized by saying that all finite points go into finite points.* 

* Tf we consider the still more special case in which the constant terms in the 


numerators of (8) and (9) are zero, that is, affine transformations in which the origin 
is transformed into itself, we see that our formule (8) and (9) have the form (6) and 
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These affine transformations are of much importance in mechanics, 
where they are known as homogeneous strains; cf., for instance, 
Webster’s Dynamics (Leipzig, Teubner), pp. 427-444. 

Although we propose to leave the detailed discussion of singular 
transformations to the reader (see Exercise 1 at the end of this sec- 
tion), we will give one theorem concerning them. 


THEOREM 1. (Jf the points P,, P,, +: are carried over by a singu- 
lar projective transformation into the points P|, P), ---, then, if our 
transformation is in one dimension, the points P!' will all coinetde ; 
uf in two dimensions, they will all be collinear ; tf in three dimensions, 
they will all be complanar, ete. 


Suppose, for instance, that we have to deal. with two dimensions. 
Since the determinant of the collineation (1) is supposed to be zero, 
the three polynomials in the second members of (1) are linearly de- 
pendent; that is, there exist three constants, £,, k,, k,, not all zero, 
and such that for all values of x, y, t, 


(10) kw! + Igy! + Iegt! = 0. 


Accordingly all points (a’, y’, t’) obtained by this transformation 
lie on the line (10). 

Similar proofs apply to the cases of one dimension and of three 
or more dimensions. 


THEOREM 2. Any three distinct points on a line may be carried 
over respectively into any three distinct points on the line by one, and 
only one, projective transformation. 


Let the three initial points-be P,, P,, P;, with homogeneous codr- 
dinates (2, t,), (2 t,), (», t3) respectively, and let the points into 
which we wish them transformed be P\, P), P 5 with codrdinates 
(x\, t1), (xy th), (v5, ty’). The projective transformation 


a! = ax + Bt, 
t=ya+ ot 


(1) respectively. Thus (6) may be regarded either as the general projective transfor- 
- mation of a line (if x, ¢ are regarded as homogeneous coordinates) or as a special 
affine transformation of the plane (if x, ¢ are regarded as non-homogeneous codrdi- 
nates). Similarly (1) may be regarded either as the general projective transformation 
of a plane, or as a special affine transformation of space. 
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carries over any given point (a, ¢) into a point (2’, t’) whose position 
depends on the values of the constants «, 8, y, 6. Our theorem is 
true if it is possible to find one, and, except for a constant factor 
which may be introduced throughout, only one, ‘set of seven con- 
stants — four, «, 8, y, 6, and three others, p,, p,, ps, none of which is 
zero — which satisfy the six equations 


Pit =a, + Pty, [pyre =e, + Bly, [pgr3 = 423 + Ble, 
Pity = YX, + Oty [Poth =YX_t Sty, — [Pats = Xs + Oty. 


Since the z’s and ?@’s are all known, we have here six homogeneous 
linear equations in seven unknowns. Hence there are always solu- 
tions other than zeros, the number of independent ones depending on 
the rank of the matrix of the coefficients. Transposing and rear- 
ranging the equations, we have 


ta+tB — 2p; = 0, 
ayy + t,6 — tip, =U, 
ye + t8 — UP» =, 
way + 1,8 — tp, =0, 
gt + ts . — pz = 0, 
Lay + 30 — tp, =0. 


The matrix of these equations is of rank six. For consider the 
determinant of the first six columns with its sign reversed, 


ie te ae 
tee 4, OF 0 1 nn ek 
eS OD a OF 
0. Sra ar eae ee 
Zaye > hes) ee Oe 
0 «0 ee Ate 0 


Since P,, P,, Ps; are distinct, there exist two constants C4, Ca) 
neither of which is zero, such that 


ey, + ot, + 2,= 0, 


Ct) +t, +t, =0. 
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Hence, adding to the fifth row of D ¢, times the first row and c, times 
the second, and to the sixth row e¢, times the third row and ¢, times 
the fourth, we have 














OE WE Stee OAS a VE Sid 
bei, 0: aa) a 
ie 0 Otte t, 4 O t,t, |? | a af, 
ee ee ee Ae rity in|? 
2 2 2 Volo 1 4 
0 0 O 0 ae Ope), 
eee 0. <0 Oct sae, 


and this is not zero, since P| and P} are distinct as well as des 
and P,. ‘ 

In the same way we see that the determinants obtained by 
striking out the sixth and the fifth columns respectively of the 
matrix are not zero. Accordingly, by Theorem 4, §17, we see that 
the equations have a solution in which none of the quantities 
Pi Po Ps are zero, and that every solution is proportional to this 
one. All these solutions obviously yield the same projective trans- 
formation of the line. 


CoroLLary. The transformation just determined is non-singular. 


This follows, by a reference to Theorem 1, from the fact that it 
does not carry P,, P,, P; into a single point. 


EXERCISES 


1. Discuss singular projective transformations in one, two, and three dimen- 
sions; noting, in particular, the effect of the rank of the matrix of the transfor- 
mation, first, on the distribution of the points which have no corresponding points 
after the transformation, and secondly, on the distribution of the points into which 
no points are carried over by the transformation. 


2. Prove that any four complanar points no three of which are collinear may 
be carried over into any four points in the plane, no three of which are collinear, 
by one and only one collineation. 


3. State and prove the corresponding theorem in n dimensions. 


4. Prove that the transformation from a first system of homogeneous coérdi- 
nates to a second is effected by a non-singular linear transformation. Consider 
the case of one, two, and three dimensions. 
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5. Prove that a projective transformation in space effects on every plane a 
two-dimensional, and on every line a one-dimensional, projective transformation, 
while at the same time the positions of the plane and line are changed. 


[Sucerstion. If p and p/ are any two corresponding planes, assume in any way a 
pair of perpendicular axes in each of them, and denote by (@,, y,, ¢,), and (a/, y/, ¢ 
respectively the systems of two-dimensional homogeneous coérdinates based on these 
axes. Then show, by using the result of Exercise 4, that the transformation of one 
plane on the other will be expressed by writing «{, y/, t{ as homogeneous linear poly- 
nomials in 2,, Y) ¢-] 


25. Further Development of the Algebra of Matrices. We proceed 
to establish certain further properties of matrices, leaving, however, 
much to the reader in the shape of exercises at the end of the section. 

The theory of linear transformations suggests to us at once certain 
properties of matrices. The first of these is : 


THEOREM 1. The matrix 


ae ONr ee 0 
‘(areal Wace 0 

I= 
G40 eee 1. 


has the property that if a is any matrix whatever 
Ia=al=a. * 


For the linear transformation of which a is the matrix will evi- 
dently not be changed by being either followed or preceded by the 
identical transformation of which I is the matrix. 

If we do not wish to use the idea of linear transformation, we may 
prove the theorem directly by actually forming the products Ia and al. 

This theorem tells us that I plays in the algebra of matrices the 
same role that is played by 1 in ordinary algebra. For this 
reason I is sometimes called the writ matrix or tdemfactor. 

Let us now consider any non-singular linear transformation and 
its inverse. These two transformations performed in succession in 
either order obviously lead to the identical transformation. This 
gives us the theorem : 


THEOREM 2. If Ay °** Un 
eee ee 


Any *** Ann 
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is a non-singular matrix of determinant a, and if A,; denote in the ordi- 
nary way the cofactors of the elements of a, the matrix 





Ate An 
a a 
9 
A jn A nn 
aoe 


called the inverse of a, and denoted by a-1,is a non-singular matrix 
which has the property that 


aa t=ala=I. 


This suggests that we define positive and negative integral 
powers of matrices as follows : 


DEFINITION 1. Jf p is any positive integer and a any matrix we 
understand by a? the product aa --- a to p factors. If ais a non- 
singular matrix, we define its negative and zero powers by the for- 
mule 

a?=(aly, av=l 
From this definition we infer at once 
THEOREM 3. The laws of indices 
aPat — arta, (a?) = qr 
hold for all matrices when the indices p and q are positive integers, and 


for all non-singular matrices when p and q are any integers. 


We turn now to the question of the division of one matrix by 
another. We naturally define division as the inverse of multiplica- 
tion, and, since multiplication is not commutative, we thus get two 
distinct kinds of division ; a divided by b being on the one hand a 
matrix x such that A 


on the other hand a matrix y such that 
a=yb. 


On account of this ambiguity, the term division is not ordinarily used 
here. We have, however, as is easily seen, the following theorem : 
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Turorem 4. Jf ais any matric and b any non-singular matrix, 
there exists one, and only one, matrix x which satisfies the equation 


a= Dx, 
and one, and only one, matrix y which satisfies the equation 
a=yb, 
and these matrices are given respectively by the formule 
x =b-a, yeab = 


A special class of matrices is of some importance ; namely, those 
of the type 


fone GO; 7-228 30 
Ok - 0 
Or 0! ak 


Such matrices we will call scalar matrices for a reason which will 
presently appear. 

If we denote by k the scalar matrix just written, and by a any 
matrix of the same order as k, we obtain readily the formula 


(1) ka = ak = ka. 


If now, besides the scalar matrix k, we have a second scalar matrix 
lin which each element in the principal diagonal is 7, we have the 
two formule 


(2) k+1l=1+k=(k+))1, 
(3) kl = 1k = XII. 


Formula (1) shows that scalar matrices may be replaced by ordinary 
scalars when they are to be multiplied by other matrices; while 
formule (2) and (8) show that scalar matrices combined with one 
another not only obey all the laws of ordinary scalars, but that each 
scalar matrix may in such cases ,.be replaced by the scalar which 
occurs in each element of its principal diagonal provided that at the 
end of the work the resulting scalar be replaced by the correspond: 
ing scalar matrix. 


LINEAR TRANSFORMATIONS AND MATRICES ce 


For these reasons we may, in the algebra of matrices, replace all 
scalar matrices by the corresponding scalars, and then consider that 
all scalars which enter into our work stand for the corresponding 
scalar matrices. If we do this, the unit matrix I will be represented 
by the symbol 1. 


DEFINITION 2. By the adjoint A of a matrix a is understood 
another matrix of the same order in which the element in the ith row 
and jth column is the cofactor of the element in the jth row and ith 
column of a.* 


It will be seen that when a is non-singular, 
(4) A= aa, 


but it should be noticed that while every matrix has an adjoint, only 
non-singular matrices have inverses. 
Equation (4) may be written in the form 


(5) Aa=aA =al, 


a form in which it is true not merely when a is non-singular, but also, 
as is seen by direct multiplication, when the determinant of a is zero. 

Finally we come to a few important theorems concerning the 
rank of the matrix obtained by multiplying together two given 
matrices. In the first place, we notice that the rank of the product 
is not always completely determined by the ranks of the factors. 
This may be shown by numerous examples, for instance, in formula 
(5), § 22, the ranks of the factors are in general two and one, and the 
rank of the product is zero, while in the formula 


G1, Gy. 0 ORO AO 0 0 -a% 
te Go) Wi 0 0 A Nis) O70 do 
Bay Ge, 0 OOO OM OFT ag 


the ranks of the factors are in general the same, namely two and 
one, while the rank of the product is one. 

But though, as this example shows, the ranks of the factors (even 
together with the order of the matrices) do not suffice to determine 
the rank of the product, there are, nevertheless, important inequali- 
ties between these ranks, one of which we now proceed to deduce. 


* Notice the interchange of rows and columns here, which in the case of adjoint 
determinants, being immaterial and sometimes inconvenient, was not made, 
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For this purpose consider the two matrices 


Oy ein by +++ Stn 


Any °** Ong, Day = bin 


and their product ab. 


THEOREM 5. Any k-rowed determinant of the matrix ab is equal 
to an aggregate of k-rowed determinants of b each multiplied into a 
polynomial in the a’s, and also to an aggregate of k-rowed determinants 
of a each multiplied by a polynomial in the b’s. 


For any k-rowed determinant of ab may be broken up into a sum 
of determinants of the &th order in such a way that each column of 
each determinant has one of the 0’s as a common factor.* After 
taking out these common factors from each determinant, we have 
left a determinant in the a’s which, if it does not vanish identically, is 
a k-rowed determinant of a. Or, on the other hand, we may break 
up the k-rowed determinant of ab into a sum of determinants of -the 
kth order in such a way that each row of each determinant has one 
of the a’s as acommon factor. After taking out these common factors 
from each determinant, we have left a determinant in the 6’s which, 
if it does not vanish identically, is a k-rowed determinant of b. 

From the theorem just proved it is clear that if all the k-rowed 
determinants of a or of b are zero, the same will be true of all the 
k-rowed determinants of ab. Hence 


Turorem 6. The rank of the product of two matrices cannot 
exceed the rank of either factor. 


* The truth of this statement and the following will be evident if the reader 
actually writes out the matrix ab. 

+ Thus if 7, and rz are the ranks of the two factors and # is the rank of the prod- 
uct, we have RSn, R<rz. This is one half of Sylvester’s ‘‘ Law of Nullity,” of 
which the other half may be stated in the form R>r; + 72 — 7, where n is the order 
of the matrices ; cf, Exercise 8 at the end of this section. Sylvester defines the nullity 
of a matrix as the difference between its order and its rank, so that his statement of 
the law of nullity is: The nullity of the product of two matrices is at least as great 
as the nullity of either factor, and at most as great as the sum of the nullities of the 
factors. 
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There is one important case in which this theorem enables us to 
determine completely the rank of the product, namely, the case in 
which one of the two matrices a or b is non- singular. Suppose first 
that a is non-singular, and denote the ranks of b and ab by rand R 
respectively. By Theorem 6, R<r. We may, however, also regard 
b as the product of a+ and ab, and hence, applying Theorem 6 
again, we have r< #. - Combining these two results, we see that 
Felts 

On the other hand, if b is non-singular, and-we denote the ranks 
of a and ab respectively by r and R, we get from Theorem 6, R<7r; 
and, applying this theorem again to the equation 


(ab) bt = a, 


we haver<R. Thus again we get r= R. 
We have thus established the result : 


THEOREM 7. Jf a matrix of rank ris multiplied in either order by 
a non-singular matrix, the rank of the product is also r. 


EXERCISES 


1. Prove that a necessary and sufficient condition that two matrices a and b 
of the same order be equivalent is that there exist two non-singular matrices 
ec and d such that hoes pb: 


Cf. § 22, Exercise 2, and § 19, mee cise 4. 


2. Prove that a necessary and sufficient condition that two matrices a and b 
of the same order be equivalent is that there exist four matrices c, d,e, f such 
= dac =p; a = fbe. 

3. Prove that every matrix of rank r can be written as the sum of r matrices 
of rank one.* 


[Suceustion. Notice that the special matrix mentioned i in § 19, Exercise 3, can be 
so written. ] 


* A matrix of rank one has been called by Gibbs a dyad, since it may (cf. 
§ 19, Ex. 5) be regarded as a product of two complex quantities (a, a2, +++ @,) and 
(b1, be, «++ bn). The sum of any number of dyads is called a dyadic polynomial, or 
simply a dyadic. Fvery matrix is therefore a dyadic, and vice versa. Gibbs’s theory 
of dyadics, in the case n = 3, is explained in the Vector Analysis of Gibbs-Wilson, 
Chap. V. Geometric language is used here exclusively, the complex quantities 
(a1, G2, a3) and (61, be, b3) from which the dyads are built up being interpreted as 
vectors in space of three dimensions. ‘This theory is equivalent to Hamilton’s theory 
of the Linear Vector Function in Quaternions. 
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4. Prove that a necessary and sufficient condition that a matrix be a divisor 
of zero (cf. § 22, Exercise 1) is that it be singular. 


[Sucerstion. Consider equivalent matrices. ] 


5. Prove that the inverse of the product of any number of non-singular 
matrices is the product of the inverses of these matrices taken in the reverse 
order. 

Hence deduce a similar theorem concerning the adjoint of a product of any 
number of matrices, whether these matrices are singular or not. 

What theorem concerning determinants can be inferred ? 


6. Prove that the conjugate of the inverse of a non-singular matrix is the 
inverse of the conjugate; and that the conjugate of the adjoint of any matrix 
is the adjoint of the conjugate. 


7. Prove that if a matrix has the property that its product with every matrix 
of the same order is commutative, it is necessarily a scalar matrix. 


8. If ri and r, are the ranks of two matrices of order n, and R the rank of 


their product, prove that REA 4 ry — ne 


[Suecestion. Prove this theorem first on the supposition that one of the two 
matrices which are multiplied together is of the form mentioned in Exercise 3, § 19, 
using also at this point Exercise 1, § 8. Then reduce the general case to this one by 
means of Exercise 1 of this section. | 


26. Sets, Systems, and Groups. These three words are the 
technical names for conceptions which are to be met with in all 
branches of mathematics. In fact the first two are of such gener- 
ality that they may be said to form the logical foundation on which 
all mathematics rests.t In this section we propose, after having 
given a brief explanation of these three conceptions, to show how 
they apply to the special subjects considered in this chapter. 

The objects considered in mathematics — we use the word object 
in the broadest possible sense — are of the most varied kinds. We 
have, on the one hand, to mention a few of the more important ones, 
the different kinds of quantities ranging all the way from the posi- 
tive integers to complex quantities and matrices. Next we have in 
geometry not only points, lines, curves, and surfaces but also such 


* Cf. the footnote to Theorem 6. 

+ For a popular exposition of the point of view here alluded to, see my address on 
The Fundamental Conceptions and Methods of Mathematics, St. Louis Congress of 
Arts and Science, 1904, Reprinted in Bull. Amer. Math. Soc., December, 1904. 
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things as displacements (rotations, translations, etc.), collineations, 
and, in fact, geometrical transformations in general. Then in vari- 
ous parts of mathematics we have to deal with the Theory of 
Substitutions, that is, with the various changes which can be made 
in the order of certain objects, and these substitutions themselves 
may be regarded as objects of mathematical study. Finally, in 
mechanics we have to’ deal with such objects as forces, couples, 
velocities, etc. 

These objects, and all others which are capable of mathematical 
consideration, are constantly presenting themselves to us, not singly, 
but in sets. Such sets (or, as they are sometimes called, classes) of 
objects may consist of a finite or an infinite number of objects, or 
elements. We mention as examples: 

(1) All prime numbers. 

(2) All lines which meet two given lines in space. 

(8) All planes of symmetry of a given cube. 

(4) All substitutions which can be performed on five letters. 

(5) All rotations of a plane about a given line perpendicular to it. 

Having thus gained a slight idea of the generality of the con- 
ception of a set, we next notice that in many cases in which we have 
to deal with a set in mathematics, there are one or more rules by 
which pairs of elements of the set may be combined so as to give 
objects, either belonging to the set or not as the case may be. As 
examples of such rules of combination, we mention addition and 
multiplication both in ordinary algebra and in the algebra of ma- 
trices; the process by which two points, in geometry, determine a 
line; the process of combining two displacements to give another 
displacement, etc. 

Such a set, with its associated rules of combination, we will call 
a mathematical system, or simply a system.* 

We come now to a very important kind of system known as a 
group, which we define as follows : 


* This definition is sufficiently general for our immediate purposes. In general, 
however, it is desirable to admit, not merely rules of combination, but also relations be- 
tween the elements of a system. In fact we may have merely one or more relations 
and no rules of combination at all. From this point of view the positive integers with 
the relation of greater and less would form a system, even though we do not introduce 
any rule of combination such as addition or multiplication. It may be added that rules 
of combination may be regarded as merely relations between three objects; cf. the 
address referred to above. 

G 
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DEFINITION. A system consisting of a set of elements and one rule 
of combination, which we will denote by 0, is called a group of the follow- 
ing conditions are satisfied : 

(1) If a and b are any elements of the set, whether distinct or not, 
aob is also an element of the set.* 

(2) The associative law holds ; that is, if a, b, e are any elements 


of the set, (aob)oe=ao(boe). 


(3) The set contains an element, t, called the identical element, 
which is such that every element is unchanged when combined with it, 


Lod=a40t=U. 


(4) If a ts any element, the set also contains an element a’, called the 
inverse of a, such that ee ee es 

Thus, for example, the positive and negative integers with zero 
form a group if the rule of combination is addition. In this case 
zero is the identical element, and the inverse of any element is its 
negative. These same elements, however, do not form a group if 
the rule of combination is multiplication, for while conditions (1), 
(2), and (8) are fulfilled (the identical element being 1 in this case), 
condition (4) is not, since zero has no reciprocal. 

Again, the set of all real numbers forms a group if the rule 
of combination is addition, but not if it is multiplication, since in 
this case zero has no inverse. If we exclude zero from the set, we 
have a group if the rule of combination is multiplication, but not if 
it is addition. 

As an example of a group with a finite number of elements we 
mention the four numbers 


hes 1) acs ete aN eee 


with multiplication as the law of combination. 

In order to get an example of a group of geometrical operations, 
let us consider the translations of a plane, regarded as a rigid lamina, 
in the directions of its own lines. Every such translation may be 
represented both in magnitude and in direction by the length and 


* A system satisfying condition (1) is sometimes said to have ‘‘the group prop- 
erty.’’? In the older works on the subject this condition was the only one to be 
explicitly mentioned, the others, however, being tacitly assumed, 
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direction of an arrow lying in the plane in question. Two such 
translations performed in succession are obviously equivalent to a 
translation of the same sort represented by the arrow obtained by 
combining the two given arrows according to the law of the paral- 
lelogram of forces. The set of all translations with the law of 
combination just explained is readily seen to form a group if we 
include in it the null translation, t.e. the transformation which leaves 
every point in the plane fixed. This null translation is then the 
identical element, and two translations are the inverse of each other 
if they are equal in magnitude and opposite in direction. 

All the groups we have so far mentioned satisfy, not only the 
four conditions stated in the definition, but also a fifth condition, 
viz. that the law of combination is commutative. Such groups are 
called commutative or Abelian groups. In general, however, groups 
do not have this property. As examples of non-Abelian groups, 
we may mention first the group of all non-singular matrices of a 
given order, the rule of combination being multiplication ; and 
secondly the group of all matrices of a given order whose deter- 
minants have the value +1, the rule of combination being again 
multiplication. This second group is called a subgroup of the first, 
since all its elements are also elements of the first group, and the 
rule of combination is the same in both cases. A subgroup of the 
group last mentioned is the group of all matrices of a given order 
whose determinants have the value +1,* the rule of combination 
being multiplication. 

We add that non-Abelian groups may readily be built up whose 
elements are linear transformations, or collineations. On the other 
hand, Abelian groups may be formed from matrices if we take as our 
rule of combination addition instead of multiplication. 


27. Isomorphism. 


Derrnition. Two groups are said to be isomorphict tf it is possible 
to establish a one-to-one correspondence between their elements of such a 


* These are called unimodular matrices ; or, more accurately, properly unimodular 
matrices to distinguish them from the improperly unimodular matrices whose determi- 
nants have the value —1. It should be noticed that these last matrices taken by them- 
selves do not constitute a group, since they do not even have the group property. 

+ Simply isomorphic would be the more complete term. We shall, however, not 
be concerned with isomorphism which is not simple. 


84 INTRODUCTION TO HIGHER ALGEBRA 


sort that if a, b are any elements of the ‘first group and a’, b' the corre- 
sponding elements of the second, then a! ob! corresponds to aob.* 


We proceed to illustrate this definition by some examples, 
leaving to the reader the proofs of the statements we make. In each 
case we omit the statement of the rule of combination in the case of 
transformations, where no misunderstanding is possible. 


First EXAMPLE. (a) The group of the four elements 
1 Ge PG SY 


the rule of combination being multiplication. 

(6) The group of four rotations about a given line through angles 
61, 0,90°,- 180%, 270%; 

These two groups may be proved to be isomorphic by pairing the 
elements against one another in the order in which they have just 
been written. 


SEcOND EXAMPLE. (a) The group of the four matrices 


1 0\ /-1 ON ea 0\ /-1 0 
(0 1), ( 0 _1), (0 _1),( 0 1), 
the rule of combination being multiplication. 

(6) The group of the following four transformations: the iden- 
tical transformation ; reflection in a plane; reflection in a second 
plane at right angles to the first ; rotation through 180° about the 
line of intersection of these two planes. 

(c) The group consisting of the identical transformation and of 
three rotations through angles of 180° about three straight lines 
through a point at right angles to one another. 

The two groups of Example 1 are not isomorphic with the three of 
Example 2 in spite of the fact that there are the same number of ele- 
ments in all the groups. This follows from the presence of two 


elements in the groups of Example 1 whose squares are not the 
identical element. 


* This idea of isomorphism may obviously be extended to the case of any two sys- 
tems provided merely that there are the same number of rules of combination in both 
cases. Thus the system of all scalar matrices on the one hand and of all scalars on the 
other, the rules of combination being in both cases addition and multiplication, are ob- 
viously isomorphic. It is for this reason that no confusion arises if no distinction is 
made between scalar matrices and scalars. 
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THIRD EXAMPLE. (a) The group of all real quantities; the rule 
of combination being addition. 

(6) The group ofall scalar matrices of order & with real ele- 
ments; the rule of combination being addition. 

(ec) The group of all translations of space parallel to a given 
line. 


FourtH Exampte. (a) The group of all non-singular matrices 
of order n, with multiplication as the rule of combination. 

(6) The group of all non-singular homogeneous linear transfor- 
mations in n variables. 


We might be tempted to mention as a group of geometrical trans- 
formations isomorphic with the last two groups, the group of all 
non-singular collineations in space of nm —1 dimensions. This, how- 
ever, would be incorrect, for the correspondence we have established 
between collineations and linear transformations is not one-to-one; 
to every linear transformation corresponds one collineation, but to 
every collineation correspond an infinite number of linear transfor- 
mations, whose coefficients are proportional to one another.* In 
order to get a group of geometrical transformations isomorphic with 
the group of non-singular matrices of the nth order it is sufficient 
to ipterpret the variables z,, --- x, as non-homogeneous codrdinates in 
space of m dimensions, and to consider the geometric transformation 
effected by non-singular homogeneous linear transformations of 
these z’s. These transformations are those affine transformations of 
space of m dimensions which leave the origin unchanged ; cf. the 
footnote on p. 70. Thus the group of all non-singular matrices of 
the nth order is isomorphic with a certain subgroup of the group 
of collineations in space of n dimensions, not with the group of all 
non-singular collineations in space of n — 1 dimensions. 

An essential difference between these two groups is that one 


* This does not really prove that the groups are not isomorphic, since it is con- 
ceivable that some other correspondence might be established between their elements 
which would be one-to-one and of such a sort as to prove isomorphism. Even the 
fact, to be pointed out presently, that the groups depend on a different number of 
parameters does not settle the question. A reference to the result stated in Exercise 7, 
§ 25, shows that the groups are not isomorphic ; for, according to it, the only non- 
singular collineation which is commutative with all collineations is the identical 
transformation, whereas all linear transformations with scalar matrices have this 


property. 
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depends on n* parameters (the n? coefficients of the linear transforma- 
tion) while the other depends only on n?—1 parameters (the ratios 
of the coefficients of the collineation). 

We can, however, by looking at the subject a little differently, 
obtain a group of matrices isomorphic with the group of all non- 
singular collineations in space of n—1 dimensions. For this purpose 
we need merely to regard two matrices as equal whenever the ele- 
ments of one can be obtained from those of the other by multiplying 
all the elements by the same quantity not zero. When we take this 
point of view with regard to matrices, it is desirable to indicate it 
by a new terminology and notation. According to a suggestion of 
E. H. Moore of Chicago, we will call such matrices fractional 
matrices, and write them 


My AQ %g 





ieee Wi) 
n M4 
oe Mo, Ugg Ag ||, ete. 

















ee Onis Cea Ses 

Agreeing that fractional matrices are to be added and mul- 
tiplied according to the same rules as ordinary matrices, we may now 
say that the group of all non-singular collineations in space of » —1 
dimensions is isomorphic with the group of all fractional matrices of 
the nth order whose determinants are not zero.* 

To take another example, the groups in the second example above 
are isomorphic with the group whose elements are the four fractional 
matrices 

: 


and where the law of combination is multiplication. These four 
matrices, if regarded as ordinary matrices, would not even satisfy 
the first condition for a group. 

The reader wishing to get a further insight into the theory of 
groups of linear transformations will find the following three treat- 


ett 
List) 


Ok 
—1 0 








’ ’ 








a 
in a 











com 























* Tt should be noticed that we cannot speak of the value of the determinant of a 
fractional matrix unless this value is zero, for if we multiply all the elements of the 
matrix by c we do not change the matrix, but do multiply the determinant by c. 
There is in particular no such thing as a unimodular fractional matrix. We may, 
however, speak of the rank of a fractional matrix. 
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ments interesting and instructive. They duplicate each other to 
only a very slight extent. 

Weber, Algebra, Vol. II. 

Klein, Vorlesungen tiber das Tkosaeder. 

Lie-Scheffers, Vorlesungen tiber continuirliche Gruppen. 


EXERCISES 


1. DeFIniTIon. A group is said to be of order n if it.contains n, and only n, 
elements. 


If a group of order n has a subgroup, prove that the order of this subgroup is 
a factor of n. 


[Sucerstion. Denote the elements of the subgroup by a, -:- ay, and let b be any 
other element of the group. Show that bay, bas, --- ba, are all elements of the group 
distinct from each other and distinct from the q’s. If there are still other elements, 
let c be one and consider the elements cay, -++ ca,, etc. ] 


2. Prove that if a is any element of a group of finite order, it is possible by 
multiplying a by itself a sufficient number of times to get the identical element. 


DeFinition. The lowest power to which a can be raised so as to give the identical 
element is called the period of a. 


3. Prove that every element of a group of order n has as its period a factor 
of n (1 and n included). 


4. Derinition. A group is called cyclic if all its elements are powers of a 
single element. 

Prove that all cyclic groups of order n are isomorphic with the group of rota- 
tions about an axis through angles 0, w, 2, --- (n—1) w, where w= 2 7/n, and 
that conversely every such group of rotations is a cyclic group. 


5. Prove that every group whose order is a prime number is a cyclic group. 


6. Prove that all groups of order 4 are either cyclic or isomorphic with the 
groups of the second example above. <A group of this last kind is called a fours 


group (Vierergruppe). 


7. Obtain groups with regard to one or the other of which all groups of 
order 6 are isomorphic. 


8. Obtain groups with regard to one or the other of which all groups of 
order 8 are isomorphic. 


CHAPTER VII 
INVARIANTS. FIRST PRINCIPLES AND ILLUSTRATIONS 


28. Absolute Invariants, Geometric, Algebraic, and Arithmetical. 
If we subject a geometric figure to a transformation, we find that, 
while many properties of the figure have been altered, others have 
not. If we consider, not a single transformation, but a set of trans- 
formations, then those properties of figures which are not changed by 
any of the transformations of the set are said to be invariant prop- 
erties with regard to this set of transformations. Thus if our set of 
transformations is the group of all displacements, the property of 
two lines being parallel or perpendicular to each other and the 
property of a curve being a circle are invariant properties, since 
after the transformation the lines will still be parallel or perpendicu- 
lar and the curve will still be a circle. If, however, we consider, 
not the group of displacements, but the group of all non-singular 
collineations, none of the properties just mentioned will be invariant 
properties. Properties invariant with regard to all non-singular 
collineations have played such an important part in the development 
of geometry that a special name has been given to them, and they 
are called projective or descriptive properties. As examples of such 
projective properties we mention the collinearity and complanarity 
of points, the complanarity and concurrence of lines, etc.; or, on 
the other hand, the contact of a line with a curve or a surface or 
the contact of two curves or of two surfaces, or of a curve and a 
surface. 


DeFIniTion 1. Jf there ts associated with a geometric figure a 
quantity which is unchanged by all the transformations of a certain 
set, then this quantity is called an invariant with regard to the trans- 
formations of the set. 


For instance, if our set of transformations is the group of dis- 
placements, the distance between two points and the angle between 
two lines would be two examples of invariants. 

88 
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The geometric invariants so far considered lead up naturally to 
the subject of algebraic invariants. Thus let us consider the two 


olynomials 
“iy A+ Byt+ C, 
A,x+ Bay + OC, 


and subject the variables (x, y) to the transformations of the set 
(2) a’ =xcos6+ysind + a, 
y' = —xsind+ ycos6+8, 


where «, 8, are parameters which may have any values. The trans- 
formation (2) carries over the polynomials (1) into two new poly- 
nomials: 
(3) ee + Biy' +O, 

Aja! + Bhy' + Ch. 


The coefficients of (3) may be readily expressed in terms of the coeffi- 
cients of (1) and the parameters a, 8, 0. Using these expressions, 
we easily obtain the formule 


(4) AR Al Bi AR, — A,B, 
AA! + BIB! = A,A, + B,By 


We shall therefore speak of the two expressions 
(5) A,B,—A,B,, A,A,+B,B, 


as invariants of the system of polynomials (1) with regard to the set 
of transformations (2) according to the following general definition : 


DEFINITION 2. If we have a system of polynomials in the variables 
(x, Y, 2 +...) and a set of transformations of these variables, then any 
Sunetion of the coefficients of the polynomials is called an invariant (or 
more accurately an absolute invariant) with regard to these transforma- 
tions if it is unchanged when the polynomials are subjected to all the 
transformations of the set. 

The relation of the example considered above to the subject of 
geometric invariants becomes obvious when we notice that the alge- 
braic transformations (2) may be regarded as expressing the dis- 
placements of plane figures in their plane when (z, y) are rectangular 
coordinates of points in the plane. If now we consider, not the poly- 
nomials (1), but the two lines determined by setting them equal to 
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zero, we have to deal with the displacements of these two lines. 
The invariants (5) have themselves no geometric significance, but by 
equating them to zero, we get the necessary and sufficient conditions 
that the two lines be respectively parallel and perpendicular, and 
these, as we noticed above, are invariant properties with regard to 
displacements. Finally we may notice that the ratio of the two in- 
variants (5) gives the tangent of the angle between the lines,—a 
geometric invariant. 

As a second example, let us consider, not two lines, but a line and 
a point. Algebraically this means that we start with the system 


Been 


6 
(°) (2, Y1)s 


consisting of a polynomial and a pair of variables. We shall wish 
to demand here that whenever the variables (a, y) are subjected to a 
transformation, the variables (x,, y,) be subjected to the same trans- 
formation, or as we say according to Definition 3 below, that (2, y) 
and (2, y,;) be cogredient variables. If we subject the system (6) ” 
to any transformation of the set (2), we get a new system 
(1) ae C, 
(py Yi)» 

and it is readily seen that 

Algit+ Blyi t+ W=Axz,+ By, + C. 


Accordingly we shall call Az, + By, + Ca covariant of the system (6) 
according to Definition 4 below. This covariant has also no direct 
geometric meaning, but its vanishing gives the necessary and suffi- 
cient condition for an invariant property, namely, that the point 
(a, y1) lie on the line Ar + By+ C=0. 

In the light of this example we may lay down the following gen- 
eral definitions : 


DEFINITION 3. Jf we have several sets of variables 
(©) Ys dy (yr Yas 29 Yo (ay Yay Ran oe )y oe 


and agree that whenever one of these sets is subjected to a transfor- 
mation every other set shall be subjected to the same transformation, 
then we say that we have sets of cogredient variables. 
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DEFINITION 4. If we have a system consisting of a number of poly- 
nomials in (x, y, 2,---) and of a number of sets of variables co- 
gredient to (x, y, 2,---), then any function of the coefficients of the 
polynomials and of the cogredient variables which is unchanged when 
the variables (x, y, 2, -+-) are subjected to all the transformations of a 
certain set is called a covariant (or more accurately an absolute cova- 
riant) of this system with regard to the transformations of this set. 


It will be seen that invariants may be regarded as special cases of 
covariants. 

Among the geometric invariants there are some which from their 
nature are necessarily integers, and which we will speak of as arith- 
metical invariants. An example would be the number of vertices of 
a polygon if our set'of transformations was either the group of dis- 
placements or the group of non-singular collineations. Another ex- 
ample is the largest number of real points in which an algebraic 
curve can be cut by a line, if our set of transformations is the group 
of real non-singular collineations. 

These arithmetical invariants also play, as we shall see, an impor- 
tant part in algebra. We mention here as an example the degree of an 
n-ary form, which isan invariant with regard to all non-singular linear 
transformations. * 

EXERCISES 


1 Yi 1 
1. Prove that (x2—21)?+ (y2—4y1)?, and 


x2 y2 1 
ta ys 1 








are covariants of the system 
(x1, Y1)5 (22, Y2)s (%3, Ys) 


with regard to the transformations (2). 
2. Prove that A+B8 and B?- AC 
are invariants of the polynomial 
Az? +2 Bry+ Cy?+2Dx+2Hy+F 
with regard to the transformations (2). 
What geometric meaning can be attached to these invariants? 


3. Prove that A? + B? is an invariant of the polynomial 


Az+By+C 
with regard to the transformations (2). 
Hence show that Am +By+C 
VA? + B? 


is a covariant of the system (6). Note its geometric meaning. 


*It is, in fact, an invariant with regard to all linear transformations except the 
one in which all the coefficients of the transformation are zero, 
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29. Equivalence. 

Derinition 1. Jf A and B denote two geometric configurations 
or two algebraic expressions, or sets of expressions, then A and B shall 
be said to be equivalent with regard to a certain set of transformations 
when, and only when, there exists a transformation of the set which car- 
ries over Ainto B and also a transformation of the set which carries over 
B into A. 

To illustrate this definition we notice that the conception of equiv- 
alence of geometric figures with regard to displacements is identical 
with the Euclidean conception of the equality or congruence of fig- 
ures. , 

Again, we see from Theorem 2, § 24, that on a straight line two 
sets of three points each are always equivalent with regard to non- 
singular projective transformations. 

In both of the illustrations just mentioned the set of transforma- 
tions forms a group. In such cases the condition for equivalence can 
be decidedly simplified, for the transformation which carries A into 
B has an inverse belonging to the set, and this inverse necessarily 
carries B into A. Thus we have the 


THEOREM. A necessary and sufficient condition for the equiva- 
lence of A and B with regard to a group of transformations is that a 
transformation of the group carry over A into B. 

This theorem will be of great importance, as the question of 
equivalence will present itself to us only when the set of transfor- 
mations we are considering forms a group. 

Let us consider, for the sake of greater definiteness, a group of 
geometric transformations. If two geometric configurations are 
equivalent with regard to this group, every invariant of the first 
configuration must be equal to the corresponding invariant of the 
second. Thus, for instance, if two triangles are equivalent with re- 
gard to the group of displacements, all the sides and angles of the 
first will be equal to the corresponding sides and angles of the second. 
The same will be true of the altitudes, lengths of the medial lines, 
radius of the inscribed circle, etc., all of these being invariants. 
Now one of the first problems in geometry is to pick out from among 
these invariants of the triangle as small a number as possible whose 
equality for two triangles insures the equivalence of the triangles. 
This can be done, for instance, by taking two sides and the included 
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angle, or two angles and the included side, or three sides. Any one 
of these three elements may be called a complete system of invariants 
for a triangle with regard to the group of displacements, since two 
triangles having these invariants in common are equivalent and 
therefore have all other invariants in common. The conception we 
have here illustrated may be defined in general terms as follows : 


DEFINITION 2. A set of invariants of a geometric configuration or an 
algebraic expression are said to form a complete system of invariants if 
two configurations or expressions having these invariants in common are 
necessarily equivalent.* 


Tt will be seen from this definition that all the invariants of a 
geometric configuration or of an algebraic expression are uniquely 
determined by any complete system of invariants. 

Finally we will glance at an application to matrices of the ideas 
of invariants and equivalence. Let us consider matrices of the nth 
order,f and consider transformations of the following form which 
transform the matrix A into the matrix B: 


(1) ; aAb = B, 


where a and b are any non-singular matrices of the nth order. This 
transformation may be denoted by the symbol (a, b), and these sym- 
bols must obviously be combined by the formula 


(ag) by) (ay, Dy) = (aay, byb,)- 


By means of this formula it may readily be shown that these trans- 
formations form a group. 

According to our general definition of equivalence, two matrices 
A and B must therefore be said to be equivalent when, and only 
when, two non-singular matrices a and b exist which satisfy (1). 
That this definition of equivalence amounts to the same thing as our 
earlier definition is seen by a reference to Exercise 1, § 25. 


* In the classical theory of algebraic invariants this term is used in a different and 
much more restricted sense. There we have to deal with integral rational relative inva- 
riants (cf.§ 31). By a complete system of such invariants of a system of algebraic forms 
is there understood aset of such invariants in terms of which every invariant of this sort 
of the system of forms can be expressed integrally and rationally. Cf. for instance 
Clebsch, Bindre Formen, p. 109. 

+ We may, if we choose, confine our attention throughout to matrices with real 
elements. 
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30. The Rank of a System of Points or a System of Linear Forms 
as an Invariant. Let (a, y,. 2, t;), (er Yor Za ba)s (a: Yor 2: bg) DC 
any three distinct collinear points, so that the rank of the matrix 


Mn 4 
T Yn % by 
UY, % ty 
is two. Now subject space to a non-singular collineation and we get 
three new points which will also be distinct and collinear, and hence the 
rank of their matrix will also be two. Thus we see that in this special 
case the rank of the system of points is unchanged by a non-singular 
collineation. 
Again, let aye + by + e2+d,t =0, 
At + boy + 62+ dot = 0, 
age + bey + c,2 + dat = 0, 
ae + by + e+ d,t=0 


be any four planes which have one, and only one, point in common, 
so that the rank of their matrix is three. After a non-singular collin- 
eation we have four new planes which will also have one, and only 
one, point in common, and hence the rank of the matrix of their 
coefficients will be three. The rank of this system of planes is 
therefore unchanged by such a transformation. 

‘We proceed now to generalize these facts. 


THEOREM 1. The rank of the matrix of m points 
oe : é 
(oll, aff, -.. af), (i=1, 2, --. m) 
is an invariant with regard to non-singular linear transformations. 
Let X= CyXy He Hy Las 
-) Se eee 
XxX, = nye + 0 FE Can Ln 
be a non-singular linear transformation which carries the points 
(zi, ++ vf) over into the points (X{!, --. XM). Now suppose any & 
of the points (2, --- al), which for convenience we will take as the 


first k, are linearly dependent. Then there exist k constants 
(¢,, +++ &) not all zero, such that 


(2) C0) + coe! ++. + ec) = 0, (f= 1, 2a 2). 


| 
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By means of the transformation (1) we have 
xt = eat! + siete +4 eal, 
hence ¢,X}+¢,X/’+--+¢,XW=c, (qa + eel! +--+ +e) + 
++ Gin (Cyan + Coty + ++ +00) Gj =1,2, ---n). 

Since this vanishes on account of (2), the first & of the points 
(Xf, --- X™) are linearly dependent. Since (1) is a non-singular 
transformation, it is immaterial which set of points we consider as the 
initial set. Thus we have shown that if any & points of either set 
are linearly dependent, the corresponding & points of the other set 
will be, also. 

Now if the rank of the matrix of the z’s is 7, at least one set of 7 
of the z-points is linearly independent, but every set of (r+ 1) of 


them is linearly dependent. Consequently the same is true for the 
X-points, and therefore their matrix must also be of rank 7. 


THEOREM 2. The rank of the matrix of m linear forms 
Filey 7 Dy S Ay L, + UyLgt os + Vink G= hs 2, Pala? mM) 
is an invariant with regard to non-singular linear transformations. 


The proof of this theorem, which is very similar to the proof of 
Theorem 1, we leave to the reader. 


It will be noticed that the invariants we have considered in this 
section are examples of what we have called arithmetical invariants. 


31. Relative Invariants and Covariants. We will begin by con- 
sidering a system of m linear forms in n variables 
Oy Py TF Ay BH ve + Ayn Ens 
(1) Any LF Ugg Vy + +++ + Gon Uns 


Lay BF Ang Ly Hes +a Ene 
DEFINITION 1. The determinant 


O44 ae Ain 


Any “Ann 


is called the resultant of the system (1). 
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Let us now subject the system (1) to the linear transformation 
Y= C1 a ae Cit 
(2) de 4 0 
In = Cny V4 + +++ + Cap Ths 
This gives the new system of forms 
Oy v a es Bin Tn 
(3) ea ae din & 
yy By ++ + Aan Try 
where Oy = iy Cyz + Ain Cy g + ++ + Gin, Cnge 


From these formule and the law of multiplication of matrices we 


7 if ne elk 

infer that O44 °° Gn A414 °** Gn C147" Cn 

() Be ae Bee 
DME cae! a a Cry 2G 
m1 ° mM nmi ~-° “nn n1 °°° “nn 


This result we state as follows: 


THEOREM 1. Jf a system of n linear forms in n variables with 
matrix ais subjected to a linear transformation with matrix c, the re- 
sulting system has as its matrix ac. 


Taking the determinants of both sides of (4), we see that the re- 
sultant of (1) is not an absolute invariant. It is, however, changed 
in only a very simple manner by a linear transformation, namely, by 
being multiplied by the determinant of the transformation. This 
leads us to the following definition : 


DEFINITION 2. A rational function * of the coefficients of a form or 
system of forms which, when these forms are subjected to any non- 
singular linear transformation, 1s merely multiplied by the wth power 
(u an integer +) of the determinant of the transformation is called a rela- 
tive invariant of weight w of the form or system of forms.t The forms 
themselves are called the ground forms. 


* Besides these rational invariants we may also consider irrational ones (cf. § 90), 
in which case the exponent u will not necessarily be an integer. 

+The condition that ~ be an integer need not be included as a part of our 
hypothesis, since it may be proved. The proof that ~ cannot be a fraction is simple. 
The proof that «4 cannot be irrational or imaginary would take us outside of the domain 
of algebra. 

t From this definition it is clear that every relative invariant is an absolute invariant 
with regard to the group of linear transformations of determinant +1. Cf. Exercise 7, §81. 
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It will be seen that absolute invariants are simply relative in- 
variants of weight zero. 


The fact pointed out above concerning the resultant may now be 
stated in the following form : 


THEOREM 2. The resultant of a set of n linear forms in n variables 
is a relative invariant of weight 1. 


We pass on now to relative covariants : 


DEFINITION 8. Jf we have a system consisting of a number of n-ary 
forms and of a number of points (Yy, +++ Yn)s (By +++ 2n)y eee the codr- 
dinates of each of which are cogredient with the variables (x,,...£,) of 
the forms, then any rational function of the coefficients of the forms and 
the codrdinates of the points which is merely multiplied by the pth power 
(u an integer) of the determinant of the transformation when the x's are 
subjected to any non-singular linear transformation is called a relative 
covariant of weight wu of the system of forms and points.* 


We may regard an invariant as the extreme case of a covariant 
where the number of points is zero. The other extreme case is that 
in which the number of forms is zero. Here we have the theorem: 


THEOREM 3. The determinant 


if 


| 
Ly SOO Am 


ffl... gl 


is a relative covariant of weight — 1 of the system of points 
iz, “iiss a, (a4, is a), te (oP, vas ey. 
For applying the transformation 
y= Cy Xt + CX 


. . 


Ly == Cyy XY ge eos Can ns 


* In most books where the subject of covariants is treated, the same letters 
(a1, + + + %) are used for one of the points as for the variables of the forms. There is 
no objection to this, and it is sometimes convenient, We prefer to use a notation 
which shall make it perfectly clear that the variables of the forms have no connection 
with the codrdinates of the points except that they are cogredient with them. 

H 
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we have 
/ 


we gl f 1 1 1 
wy Oy, | fey XE be Hy XR 1 Cp Ky ott + Can Xp 


~ 


aft] Bae ail C4 Xn +o + Cyn X/ “+ Ong Xx, fe wee 603 ae 


a / 7 
ene: Cigs| 2 iakaeseXe 


Cny + Cnn x? are Xi) 
Or DE, + X/ Cy °°" C1, at x sgh 
== ’ 
{7 ators Xr Cn nisi Con ay) oe gil 


as was to be proved. 
Another extremely simple case arises when we have a single 
form and a single point: 


TuHeorEM 4. The system consisting of the form f (2, ---x,) and 
the point (yy +** Yn) has as an absolute covariant with regard to linear 
transformations 

I Yn “i Yn): 


For let us denote f more explicitly as 
FT (Gy Ags °° 5 By eo), 
where dj, a, --- are the coefficients of f. If the coefficients after the 
transformation are a}, a}, ---, we have 
F Cal, diy 5 aly DY SS (ayy dg 3 By Be)> 
This being true for all values of the 2’s, will be true if the 2’s are 


replaced by the y’s. But when this is done, the 2’’s will be replaced 
by the y’’s, since the 2’s and y’s are cogredient. Accordingly 


SF (hy Way 3 Yh Yn) =H (Agy May 2293 Yay Yn)s 


as was to be proved. 
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The three examples of invariants and coyariants which have been 
given in this section are all polynomials in the coefficients of the 
forms and the coérdinates of the points. Such invariants we shall 
speak of as integral rational invariants and covariants.* 


THEOREM 5. The weight of an integral rational invariant cannot 
be negative. 


Let a, a, «++ 3 6,,b55--- 3 +++ be the coefficients of the system of 
forms, and let ¢, be the coefficients of the transformation. It is clear 
that the coefficients a}, a},---; 64, 65,---;. --- after the transformation are 
polynomials in the a’s, 0’s, etc., and in the ¢,’s. Now let Z be an 
integral rational invariant of weight yp, 


if ! E | ! . . . 
I(a}, Any °° 5 bi, uss sisiei 3 a) Pad Glog **+ 5 b,, Bayes +), 


where ¢ is the determinant of the transformation. Suppose now 
that » were negative, w= —v. Then 


(5) OT (al, 2+ 5 Diy oot 3 + a= T (yy + 5 Byy oe 3 +) 


This equality, like the preceding one, is known to hold for all 
values of the ¢,’s for which ¢e +0. Hence, since the expressions on 
both sides of the equality are polynomials in the a’s, 0’s, --- and the 
¢,;8, we infer, by an application of Theorem 5, § 2, that we really 
have an identity. 

Let us now assign to the a’s, 6’s, --- any constant values such that 
Ti G,,-->; 6,,---3---)#0. Then I(a},---3 5{,---3---) will be a poly- 
_ nomial in the ¢,’s alone, which, from (5), cannot be identically zero. 
The identity (5) thus takes a form which states that the product of 
two polynomials in the ¢;,’s is a constant, and since the first of these 
polynomials, ¢’, is of higher degree than zero, this is impossible. 

We will agree in future to understand by the terms znvariant and 
covariant, invariants or covariants (absolute, relative, or arithmetical) 
with regard to all non-singular linear transformations. If we wish to 
consider invariants or covariants with regard to other sets of trans- 
formations, for instance with regard to real linear transformations, 
this fact will be explicitly mentioned. 


* All rational invariants and covariants may be formed as the quotients of such as 
are integral and rational ; cf. Exercises 4, 5, § 78. 
+ It cannot be zero either; cf. Theorem 5, §79. 
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Finally, let us note the geometric meaning to be associated with 
the invariants and covariants which have been mentioned in this 
section. We confine our attention to the case of four variables. 
The vanishing of the resultant of four linear forms gives a necessary 
and sufficient condition that the four planes determined by setting 
the forms equal to zero meet in a point. The vanishing of the co- 
variant of Theorem 3 is a necessary and sufficient condition that the 
four points lie in a plane. The vanishing of the covariant of 
Theorem 4 is a necessary and sufficient condition that the point 
(Y1 Yo. Yar Ya) lie on the surface f=0. It will be seen that in all 
cases we are thus led to a projective property ; cf. §$80, 81. 


32. Some Theorems Concerning Linear Forms. 

THEOREM 1. Two systems of n linear forms in n variables are 
equivalent with regard to non-singular linear transformations if neither 
resultant is zero. 


Let CE i a ee 6152 +--+ + On ¥n 
oe ee Crean 
Any Ly + +++ + Ann ®n ie ae oe 
be the two systems, whose resultants, 
hy 17+ Ayn bi Din 
a= ; b= ; 
Any? Ang Ons wae bee 


are, by hypothesis, not zero. Applying the transformations 


jee Pee 
Dy = Ay 2 Seas ots Vn&ns ea by 2, ‘eos bin®ny 
1s oe 
Ly = Any Vy + +++ FAnnEns ty = bX ee oe a 


to (1) and (2) respectively, they are both reduced to the normal form 
Bs 
(3) Wy 


Now, since neither a nor 0 is zero, the transformations a and b have 
inverses, which when applied to (3) carry it back into (1) and (2) 
respectively. Hence the transformation bla carries (1) into (2). 
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THEOREM 2. A system of n linear forms in n variables has no 
integral rational invariants * other than constant multiples of powers 
of the resuitant. ; 

Let (1) be the given system and a its resultant, and let ¢ be the 
determinant of a non-singular linear transformation which carries 
(1) over into 


[alal t+ aaa 
cf) tenes a: 
AU, + tana. 
If we call the resultant of (4) a’, we have 
a! = ae. 


Let I( ay. -+- G,) be any integral rational invariant of the system 
(1) of weight pu, and write 
Des D(a, <= @,,,)s 
Then Leck. 


Now let us assume fora moment that a #0, and consider the special 
transformation which carries over (1) into the normal form (3). In 
this special case we have a’ =1; hence, as may also be seen directly, 
ac=1. Calling the constant value which J’ has in this particular 

h 
case k, we have Pe eT. 


or 
(5) T= kat. 


This equality, in which & is independent of the coefficients a, has 
been established so far merely for values of the a,’s for which a #0. 
Since p» is not negative (cf. Theorem 5, § 31), we can now infer 
that (5) is an identity, by making use of Theorem 5, § 2. Thus we 
see that Jis merely a constant multiple of a power of the resultant, 
as was to be proved. 


CoroLLARy. A system of m linear forms in n variables has no in- 
tegral rational invariants (other than constants) when m < Nn. 

For such an invariant would also be an integral rational invariant 
of the system of n linear forms obtained by adding n — m new forms to 
the given system; and hence it would be a constant multiple of a 


* Tt has the arithmetical invariant mentioned in Theorem 2, § 30. 
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power of the resultant of this system. This power must be zero, 
and hence the invariant must be a mere constant, as otherwise it 
would involve the coefficients of the added forms, and hence would 
not be an invariant of the system of original forms. 


EXERCISES 


1. Prove that if we have two systems of n + 1 linear forms in n variables whose 
matrices are both of rank n, a necessary and sufficient condition that these two 
systems be equivalent with regard to non-singular linear transformation is that 
the resultants of the forms of one set taken n at a time be proportional to the 
resultants of the corresponding forms of the other set. 


2. Generalize the preceding theorem. 


3. Prove that every integral rational invariant of a system of m linear forms in 
n variables (m > n) is a homogeneous polynomial in the resultants of these forms 
taken n at a time. 


4. State and prove the theorems analogous to the theorems of the present sec- 
tion, including the three preceding exercises, when the system of linear forms is re- 
placed by a system of points. 


33. Cross-ratio and Harmonic Division. Let us consider any four 
distinct points on a line 


(1) (24, ty), (%a ty), (5, ty), (yy ty)» 


We have seen, in $31, Theorem 3, that each of the six determinants 
(2) 


Lyly — Loly, Lyle — Lol, Lt, — Lyt1, 
Lely — Lylgy Lgl — Lot yy Lolz — Ugly, 


is a covariant of weight —1. The ratio of two of these determinants 
is therefore an absolute covariant, and we might be tempted, by 
analogy with the examples of absolute covariants in Exercise 1, § 28, 
to expect that it might have a geometric meaning. It will be readily 
seen, however, that this is not the case, for the value of the ratio of 
two of the determinants (2) will be changed if the two codrdinates 
of one of the four points are multiplied by the same constant, and 
this does not affect the position of the points. 

It is easy, however, to avoid this state of affairs by forming such 
an expression as the following :* 





L,t,— Lot) (Cet, — Lyt 
(3) (1, o 3, 4) =! 12 att) (®aty ats) 
(@gt3— Ugly) (yt — 2 yt) 
* The reversal of sign of the second factor in the denominator is not essential, but 
is customary for a reason which will presently be evident. 
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which is also an absolute covariant of the four points (1), and is called 
their eross-ratio or anharmonic ratio. More accurately it is called the 
cross-ratio of these four points when taken in the order written in (1).* 

In order to determine the geometric meaning of the cross-ratio 
- of four points, let us first suppose the four points to be finite so that 
tytotst,#0. Dividing numerator and denominator of (1, 2, 3, 4) by 
this eodaet we find the following expression for the cross-ratio in 
terms of the non-homogeneous coérdinates X; of the points, 


(X, a X;) (X, ee X4) 


(4) kant 6. aae Ale Hee Ai 





Finally, denoting the points by P,, P,, P;, P,, we may write 


Pipe ge 4 P,Ps 
I Tig ee HEN 





_ P,P, 
(5) (1, 2, 8, 4)= 545" 


In words, this formula tells us that the cross-ratio of four finite 
points is the ratio of the ratio in which the second divides the first 
and third and the ratio in which the fourth divides the first and 
third; and that it is also the ratio of the ratios in which the first 
and third divide the second and fourth. 

In this statement, it must be remembered that we have taken the 
ratio in which ( divides the points A, Bas AC/ BC, so that the ratio 
is negative if (divides AB internally, positive if it divides it externally. 

If we agree that the point at infinity on a line shall be said to 
divide any two finite points A, B of this line in the ratio +1 (and this 
is a natural convention since the more distant a point the more nearly 
does it divide AB in the ratio +1) it is readily seen, by going back 
to formula (3), that the first statement following (5) still holds if the 
second or fourth point is at infinity, while the second statement holds 
if the first or third is at infinity. Thus we have in all cases a simple 
geometric interpretation of the cross-ratio of four distinct points. 

The special case in which four points P,, P,, Ps, P, are so situ- 
ated that (1, 2, 8,4)= —1 is of peculiar importance. In this case 
we have 
(1, 2, 3, 4)=(1, 4, 3, 2)= (8, 2, 1, 4) = (3, 4, 1, 2)=(2, 1, 4, 8) 

= (2,3, 4, 1)==(4, 1; 2,3) =—(4, 3,2; Dj — 


* Tf these four points are taken in other orders, we get different cross-ratios : 
(1, 2, 4, 8), (1, 4, 3, 2), ete. Cf. Exercise 1 at the end of this section 


t 
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The relation is therefore merely a relation between the two pairs of 
points P,, P,; and P,, P, taken indifferently in either order, and we 
say that these two pairs of points divide each other harmonically. 
From the geometric meaning of cross-ratio, we see that, if all four 
points are finite, the pairs P,, P, and P,, P, divide each other har- 
monically when, and only when, P, and P, divide P,, P, internally 
and externally in the same ratio; and also when, and only when, Vee 
and P, divide P,, P, internally and externally in the same ratio. If 
P, or P, lies at infinity, the first of these statements alone has a 
meaning, while if P, or P, lies at infinity, it is the second statement 
to which we must confine ourselves. 

It is easily seen that the case in which three of the four points, 
say P,, P,, P;, coincide, while P,is any point on the line, may be 
regarded as a limiting form of two pairs of points which separate one- 
another harmonically. It is convenient to include this case under the 
term harmonic division, and we will therefore lay down the definition: 


DEFINITION. Two pairs of points P,, P, and P,, P, ona line are 
said to divide one another harmonically if they are distinct and their 
eross-ratio taken in the order P,, P,, P3, P, is —1, and also ¢ at least 
three of them coincide. 


It will be seen that the property of two pairs of points dividing 
each other harmonically is a projective property in space of one 
dimension. 

The most important applications of cross-ratio come in geometry 
of two, three, or more dimensions where the points are not determined 
as above by two codrdinates (or one non-homogeneous codrdinate), but 
by more. Suppose, for instance, we have four distinct finite points 
on a line in space of three dimensions. Let the points be P,, P,, 
@1, Ga, and suppose the codrdinates of P,, P, are (x,, y,, 2 t,) and 
(Xa) Yor 2g) t,) respectively. Then the codrdinates of @,, Q, may be 
written 


(x, H+ Aq, Yy + AYos 24 + AZ 6, + Ate), (a, HF MX, Yy + MY gy 21 + M2, ty + pty). 
Now, let 

(6) Av + By+ (z+ Dt=0 

be any plane through Q, but not through P,, and we have 


(Ax, + By, + Cz, + Dt) + MAx, + By, + Cz, + Dt) = 0, 
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or, since P, does not lie on (6), 


Ax, + By, + C2, + Dt, _ 


Az, + By, + Cz, + Dt, ee 





Changing to non-homogeneous coérdinates, we have 


AX, + BY,+ CZ,+ D — yf 
AX,+BY,+CZ,4+D  “# 





If P,M, and PM, are the perpendiculars from P, and P, on the 
plane (6), we have 
PG Pi, ARE BY CZ PD » 2. 
P,Q, P.M, AX,+BY,+ 0Z,+D — t, 





In exactly the same way we get 


PiQ,_ ts 


yo Mb e 
P 2s ty 


Consequently PQ, / P19, _ 
PoQ@1/ Pols 





7 
be 


This is the cross-ratio of the four points taken in the order P,, Q,, 
aa 

It is readily seen that if one of the two points Q, or Q, lies at in- 
finity, all that is essential in the above reasoning remains valid, and 
the cross-ratio is still A/p. 

The case in which one of the two points P, or P, is at infinity 
may be reduced to the case just considered by writing for the codr- 
dinates of Q, and Q,, (&) 7p» % 7,) and (&, 1) 6 7). The codrdi- 
nates of P, and P, are then 


(é, = “ty Die 


(g, — £5) 11 — No» 61 — Sa) T1 — Tp) 


xr Xr nr 
pl oT Pac) n—*n) 


Accordingly, from what has just been proved, we see that the 
cross-ratio of the four points taken in the order Q,, P,, Q, P, is /u. 
But this change of order does not change the cross-ratio. Hence in 
all cases we have the result: 
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THEOREM 1. The cross-ratio of the four distinct points 


Py (Xp Yy 2p t)s 
P, (Xp. Yor 22 by)s 
QQ, (@y + ayy YH AYas %1 + A2qy ty + Ab), 
Qa (@y + Malay Yt Yay 2 + Mey by + Mty)s 
taken in the order P 1, Q1, Po Qy 18 X/M- 
From this theorem we easily deduce the further result : 


THEOREM 2. The cross-ratio of four points on a line is invariant 
with regard to non-singular collineations of space.* 


For the four points P,, P,, Q;, Q, of Theorem 1 are carried over 
by a non-singular collineation into the four points 


Pr (as Vp BB) 
PL (@y Yo % t)s 
Qi (ay H Aah YW HAYS 24 + Az, th + AZ), 


OQ (@ + Hey Yh + HY me % + Mee, H+ ote), 
whose cross-ratio, when taken in the order P}, Q}, P}, Qj, is also X/p. 


Theorems similar to Theorems 1 and 2 hold in space of two, and in 
general in space of n, dimensions and may be proved in the same way. 


EXERCISES 
1. Denote the six determinants (2) by 
= (1,2); Clas); (1, 4), (3, 4), (4, 2), (2, 3), 


and write 


A=(1,2)(3,4), B=(,3)(4,2), C=(,4)(, 3). 


Prove that six, and only six, cross-ratios can be formed from four points by 
taking them in different orders, namely the negatives of the six ratios which can be 
formed from A, B, C' taken two and two. 


2. Prove that A + B+ C=O, and hence show that if X is one of the cross- 
ratios of four points, the other five will be 


1 1 A-1 rX 
ine 1-2, SS hE Nera © 
r 1-2 r A-1 


* This also follows from Exercise 5, § 24. 
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3. Prove that the six cross-ratios of four distinct points are all different from 
each other except in the following two cases: 

(a) The case of four harmonic points, where the values of the cross-ratios 
are —1, 2, 4. 

(8) The case known as four equianharmonic points, in which the values of 
the cross-ratios are —}+3/_ 3. 


4. Prove Theorem 2, § 24, by making use of the fact that the cross-ratio of four 
points on a line is unchanged by non-singular projective transformations of the 
line. ; 

5. By the cross-ratio of four planes which meet in a line is understood the 
cross-ratio of the four points in which these planes are met by any line which 
does not meet their line of intersection. 

Justify this definition by proving that if the equations of the four planes are 

Pi = 9, py + Ap. = 9, po = 0, py + pp, = 0 


(p, and p, homogeneous linear polynomials in 2, y, 2, t), the cross-ratio of the four 
points in which any line which does not meet the line of intersection of the planes 
is met by the planes is X/ pm. 


6. Prove that the cross-ratio of four planes which meet in a line is invariant 
with regard to non-singular collineations. 


34. Plane-Codrdinates and Contragredient Variables. If w,, u,, 
Ug, Us, are Constants, and x, %, Zz, x, are the homogeneous coérdinates 
of a point in space, the equation 
(1) Uyey + Ugg + Uys + Uyr, = 0 
represents a plane. Since the values of the w’s determine the posi- 
tion of this plane, the w’s may be regarded as codrdinates of the plane. 
We will speak of them as plane-codrdinates, just as the x’s (each set 
of which determines a point) are called point-codrdinates. And just 
as we speak of the point (2, 2, %3, %,) So we will speak of the plane 
(Uy, Uy Uz, Uy). The w’s are evidently analogous to homogeneous 
codrdinates in that if they be all multiplied by the same constant, 
the plane which they determine is not changed. 

Suppose now that we consider the 2’s as constants and allow the 
u’s to vary, taking on all possible sets of values which, with the fixed 
set of values of the z’s, satisfy (1). This equation will now repre- 
sent a family of planes, infinite in number, each one of which is de- 
termined by a particular set of values of the w’s and all of which pass 
through the fixed point (2,, v7, %3, 4). ‘The equation (1) may there- 
fore be regarded as the equation of a point m plane-codrdinates, since 
it is satisfied by the codrdinates of a moving plane which envelops 
this point, just as when the a’s. vary and the w’s are constant, it is 
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the equation of a plane in point codrdinates, since it is satisfied by the 
codrdinates of a moving point whose locus is the plane.* 

In the same way, a homogeneous equation of degree higher than 
the first in the w’s will be satisfied by the codrdinates of a moving 
plane which will, in general, envelop a surface. The equation will 
then be called the equation of this surface in plane-codrdinates. f 

Let us now subject space to the collineation 


c Vj C540 + Cigly + Cigty + Ci4Xy @=1, 2, 3, 4). 


We will assume that the determinant ¢ of this transformation is not 
zero ; and we will denote the cofactors in this determinant by C;;. 
Then the inverse of the transformation c may be written 


ot y= Gat + Cates + Satan 4 Cte (¢=1, 2, 3, 4). 
Cc 

Substituting these expressions, we see that the plane (1) goes over 

into 


(2) ayes + Une, + Uses + Uyx), = 0, 
where 


We thus see that the w’s have also suffered & linear transformation, 
though a different one from the 2’s, namely, the transformation whose 
matrix is the conjugate (cf. § 7, Definition 2)ofc-. This transforma- 
tion d of the plane-codrdinates is merely another way of expressing 
the collineation which we have commonly expressed by the transfor- 
mation c of the point-codrdinates. The two sets of variablesxand % 
are called contragredient variables according to the following 


Derinition 1. Two sets of n variables each are called contragre- 
dient if, whenever one is subjected to a non-singular linear transformation, 
the other is subjected to the transformation which has as its matrix the 
conjugate of the inverse of the matrix of the first. 


* Similarly, in two dimensions, the equation 
W4H14 + Ugly + Usxzg = 0 


represents a line in the point-codrdinate (a, %, wg) if a1, we, us are constants, or a 
point in the line-coérdinates (w1, W2, us) if «1, %2, #3 are constants. 
{+ An example of this will be found in § 53. 
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Precisely the reasoning used above in the case of four variables 
establishes here also the theorem : 


- 


THEOREM.* Jf the two sets of contragredient variables Diy 0 Be 
and U,,--- U, are carried over by a linear transformation into Ge aoe 
and wu}, --- u}, then 

Uy + Ug®a + +++ + UnLy, 
¥ 7 niall: Ve a 
will go over into Unk, + Upto + --- + U,27,. 


In connection with this subject of contragredient variables it is 
customary to introduce the conception of contravariants, just as the 
conception of covariants was introduced in connection with the sub- 
ject of cogredient variables. For this purpose we lay down the 


DEFINITION 2. Jf we have a system of forms in (x, + x) and 
also a number of sets of variables, (u,--- wh), (uf, Wh), «++ contra- 
gredient to the x's, any rational function of the ws and the coefficients of 
the forms which is unchanged by a non-s'‘ngular linear transformation of 
the x's except for being multiplied by the uth power (4 an integer) of the 
determinant of this transformation is called a contravariant of weight w. 


Thus the theorem that the resultant of m linear forms in n variables 
is an invariant of weight 1 may, if we prefer, be stated in the form: 
If we have mn sets of m variables each, (uj, --- w/,), «+» (wll, --- ul), 
each of which is contragredient to the variables («,, --- x), the de- 
terminant of the w’s is a contravariant of weight 1.+ 

It will be ‘seen that the conception of contravariant, though 
sometimes convenient, is unnecessary, since the contragredient vari- 
ables may always be regarded as the coefficients of linear forms, and, 
when so regarded, the contravariant is merely an invariant. 

Similarly, the still more general conception of mixed concomitants, 
in which, besides the coefficients of forms and the contragredient 
variables, certain sets of cogredient variables are involved, t reduces 
to the familiar conception of covariants if we regard the contra- 
gredient variables as coefficients of linear forms. 


* This is really a special theorem in the theory of bilinear forms. Cf. the next 
chapter. 

+ For other examples of contravariants in which coefficients also occur, see 
Chap. XII. 

¢ An example is ua + wove + +++ + Une, the theorem above stating that this is an 
absolute mixed concomitant. 
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35. Line-Coérdinates in Space. A line is determined by two 
points (445 Yor Ya Ya)» (21) 299 2g) 2,) Which lie on ‘it. It is clear that 
these eight codrdinates are not all necessary to determine the line ; 
and it will be seen presently that the following six quantities deter- 
mine the line completely, and may be used as line-codrdinates, 


Pie Piz Piss Psa Pao» Pos 
Yi Y; : 


2; &; 


where 


(1) Per 








In other words, the p’s are the two-rowed determinants of the matrix 


¥ Ye Ys Ya 
ye 


9 














except that the sign of the determinant obtained by striking out 
the first and third column has been changed. These six p’s are not 
all zero if, as we assume, the two points y and z are distinct. 

These six p’s are connected by the relation 


(2) PrPss + PisP12 + PrPos = 9,* | 
as may be seen either directly or by expanding the vanishing 
determinant Ve Ya Ya 

PL NE 

$i 02. 308 4 


@ @ *, %& 


by Laplace’s method in terms of the minors of the first two rows. 
That the p’s may really be used as line-codrdinates is shown by 
the following two theorems: 


THEOREM 1. When a line is given, its line-coérdinates p,; are com- 
pletely determined except for an arbitrary factor different from zero by . 
which they may all be multiplied. 

The definition (1) of the p’s shows that they may all be multi- 
plied by an arbitrary factor different from zero without affecting 
the position of the line, since the y’s (and also the 2’s) may be multi- 
plied by such a factor without affecting the position of the point. 


* Cf. Exercise 2, § 33. 
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In order to prove our theorem it is sufficient to show that, if 
instead of the two points used above for determining the p’s we use 
two other points of the line, 


(Vp Yn Ys Ys, (4 4 Zy %)s 


the line-codrdinates Fay, 





thus determined will be proportional to the p,’s. Since the points 
Y and Z are collinear with the distinct points y, z, they are linearly 
dependent upon them and we may write 


Y= CLYi + Cy Xin A ky yi ap key; (= 1, 2, 3, 4). 
According! Ge 
sly Pyle 2 Ya _ Ip, 
1 “e|l*%1 











where K#0,as Y and Z are distinct points. 


THEOREM 2. Any six constants p,; satisfying the relation (2) and 
not all zero are the line-codrdinates of one, and only one, line. 


That they cannot be the codrdinates of more than one line may be 
seen as follows: Suppose the p,;’s to be the codrdinates of a line, 
and take two distinct points y and zg on the line. The codrdinates 
of these points may then be so determined that relations (1) hold. 
Let us suppose, for definiteness, that p,,+0.* Now, consider the 
point whose coordinates are ¢,y;+¢,2, By assigning to ¢, and c, 
first the values — 2, and y,, then the values — 2, and y,, we get the 
two points 
(3) (0; Pia» Pris» Prs)s (Parr 95 Pos» Pos)s 


where, by definition, py = — p,;- 

These two points are distinct, since for the first of them the first 
coordinate is zero and the second is not, while for the second, the 
second codrdinate is zero and the first is not. These points ac- 
cordingly determine the line, and since they, in turn, are deter- 
mined by the p’s, we see that the line is uniquely determined 
by the p’s. 

* By a slight modification of the formule this proof will apply to the case in 
which any one of the p’s is assumed different from zero. 
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It remains, then, merely to show that any set of p,,;’s, not all zero, 
which satisfy (2) really determine aline. For this purpose we again 
assume p,.+# 0* and consider the two points (8) which, as above, are 
distinct. The line determined by them has as its codrdinates 


Pi» PieP1397 PiPiy —P13P42— PisPox PiePs Pri2Pos- 


Using the relation (2), the fourth of these quantities reduces to 
P12 P34 80 that, remembering that the codrdinates of a line may be 
multiplied by any constant different from zero, we see that we 
really have a line whose codrdinates are p;;. ; 

In a systematic study of three-dimensional geometry these line- 
coordinates play as important a part as the point- or plane-coérdi- 
nates; and in the allied algebraic theories we shall have to consider 
expressions having the invariant property, into which these line- 
coordinates enter just as point-codrdinates occur in covariants and 
plane-codrdinates in contravariants. We may, if we please, regard 
these expressions as ordinary covariants, since the line-codrdinates 
are merely functions of the codrdinates of two points, but the co- 
variants we get in this way are covariants of a special sort, since the 
codrdinates of the two points occur only in the combinations (1). 

As an example, let us consider four points 


(Wis Yar 2p ts) (i=1, 2, 8, 4). 


The determinant of these sixteen codrdinates is, by Theorem 3, § 31, 
a covariant of weight —1. Let us denote by pj and pi the codrdi- 
nates of the lines determined by the first two and the last two points 
respectively. Expanding the four-rowed determinant just referred 
to by Laplace’s method according to the two-rowed determinants of 
the first two rows, we get 


(4) Pi2P34 + PioPhs + PisPie + PisPi2 + PiaPos + Pls Pos: 
This, then, is an expression having the invariant property and in- 
volving only line-codrdinates. 

Since the vanishing of the four-rowed determinant from which 
we started gave the condition that the four points lie in a plane, it 
follows that the vanishing of (4) gives a necessary and sufficient 
condition that the two lines p’ and p'’ lie in a plane, or, what 
amounts to the same thing, that they meet in a point. 


* By a slight modification of the formule, this proof will apply to the case in 
which any one of the p’s is assumed different from zero. 
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EXERCISES 


1. Prove that, if the point-codrdinates are subjected to the linear transfor- 


mation j 
Li = CX + Cyo%o + CigTg + Ci4X4 (i =1, 2, 3, 4), 


the line-codrdinates will be subjected to the linear transformation 
Pé = (Cate — Ceep) pre + (Cacjs — Css) Pig + (Cacjs — Cie) Pra + (CisCja — Ci4Cya) Daa 


+ (e032 — Ci2C;4) Pao + (Cintg — CigCj2) Pose 


2. A plane is determined by three points 
(1; Y2> Y3> Ys), (215 22 23, 24)5 (1, We, W3, W4). 


Prove that the three-rowed determinants of the matrix of these three points may 
be used as co6rdinates of this plane, and that these coérdinates are not distinct 
from the plane-coérdinates defined in § 34. 


3. A line determined by two of its points may be called a ray, and the line- 
coérdinates of the present section may therefore be called ray-coérdinates. A line 
determined as the intersection of two planes may be called an azis. If (11, ua, 
uz, Us) and (v4, v2, Vs, V4) are two planes given by their plane-coérdinates, discuss 
the azis-coérdinates of their intersection, 


12) 18, G14, 934, 429. ea, 
where Qij = UV; — Ujd;- 
4. Prove that ray-coérdinates and axis-codrdinates are not essentially differ- 


ent by showing that, for any line, the q’s, taken in the order written in Exercise 3, 
are proportional to the p’s taken in the order 


P34, P42, P23, P12, 13) P14- 


5. A point is determined as the intersection of three planes 
(4, U2, U3, us), (“1 Vay U3, v4), (“1, W2, W3, W4). 


Prove that the three-rowed determinants of the matrix of these planes may be 
used as coérdinates of this point, and that they do not differ from the ordinary 
point-coérdinates. 

Hence, show that all covariants, may be regarded as invariants. 


CHAPTER VIII 
BILINEAR FORMS 


36. The Algebraic Theory. Before entering on the study of 
quadratic forms, which will form the subject of the next five chapters, 
we turn briefly to a very special type of quadratic form in 2 varia- 
bles, known as a bilinear form, and which, as its name implies, forms 
a natural transition between linear and quadratic forms. 


DeFinition 1. A polynomial in the 2n variables (a, --- 2), 
(Yp *** Yn) 18 called a bilinear form if each of its terms is of the first 
degree in the 2’s and also of the first degree in the y’s. 


Thus, for n = 3, the most general bilinear form is 
Uy XY TF Uy Yo F M31 Y3 
TH Ag %Y 1 F Age%oYq TF Un3%oYg 


TH Ag1%gY1 1 AgoVgYq + Agg¥3/3- 


This may be denoted, for brevity, by Says; and, in general, we 
may denote the bilinear form in 27 variables by 


n 
(1) Easily 
The matrix iy, °° By 
a= 
ye ae 


is called the matrix of the form (1); its determinant, the determinant 
of the form; and its rank, the rank of the form.* <A bilinear form 
is called singular when, and only when, its determinant is zero. 


* It should be noticed that the bilinear form is completely determined when its 
matrix is given, so there will be no confusion if we speak of the bilinear form a. If 
two bilinear forms have matrices a; and ag, their sum has the matrix aj+a2. The 
bilinear form whose matrix is a,a2 is not the product of the two forms, but is sometimes 
spoken of as their symbolic product. 
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Let us notice that the bilinear form (1) may be obtained by 
starting from the system of m linear forms in the y’s of matrix a, 
multiplying them respectively by 2%, Yq **+ Zp, and adding them 
together. It can also be obtained by starting from the system 
of m linear forms in the 2’s whose matrix is the conjugate of a, 
multiplying them aaah yh by Ypy Yo: Yo and adding them 
together. 

Using the first of ee two methods, we see (cf. Theorem 1, § 31) 
that if the y’s are subjected to a linear transformation with matrix 
d, the bilinear form is carried over into a new bilinear form whose 
matrix is ad. Using the second of the above methods of building 
up the bilinear form from linear forms, we see that if the z’s are 
subjected to a linear transformation with matrix c, we get a new 
bilinear form the conjugate of whose matrix is a'c, where accents 
are used to denote conjugate matrices. The matrix of the form 
itself is then (cf. Theorem 6, § 22) c’a.* 

Combining these two facts, we have 


THEorREM 1. Jf, in the bilinear form (1) with matrix a, we subject 
the as to a linear transformation with matrix c and the y's to a linear 
transformation with matrix a, we obtain a new bilinear form with matria 
clad, where c! is the conjugate of c. 


Considering the determinants of these matrices, we may say: 


THEOREM 2. The determinant of a bilinear form is multiplied by 
the product of the determinants of the transformations to which the x’s 
and y’s are subjected.t 


We also infer from Theorem 1, in combination with Theorem 7, 
§ 25, the important result: + 


THeEorEM 8. The rank of a bilinear form is an invariant with re- 
gard to non-singular linear transformations of the x's and y’s. 


DEFINITION 2. A bilinear form whose matrix is symmetric ts 
called a symmetric bilinear form. 


* These results may also be readily verified without referring to any earlier theorems. 
+ This theorem tells us that the determinant of a bilinear form is, in a generalized 
' sense, a relative invariant. Such invariants, where the given forms depend on several 
sets of variables, are known as combinants. 

t This result may also be deduced from Theorem 2, § 30. 
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THeEoreM 4. A symmetric bilinear form remains symmetric if we 
subject the a’s and the y's to the same linear transformation. 


For if c is the matrix of the transformation to which both the 2’s 
and the y’s are subjected, the matrix of the transformed form will, 
by Theorem 1, be c’ac. Remembering that a, being symmetric, is 
its own conjugate, we see, by Theorem 6, § 22, that c’ac is its own 
conjugate. Hence the transformed form is symmetric. 


EXERCISES 


1. Prove that a necessary and sufficient condition for the equivalence of two 
bilinear forms with regard to non-singular linear transformations of the z’s and 
y’s is that they have the same rank. 


2. Prove that a necessary and sufficient condition that it be possible to factor 


a bilinear form into the product of two linear forms is that its rank be zero or one. 


3. Prove that every bilinear form of rank r ean be reduced by non-singular 
linear transformations of the 2’s and y’s to the normal form 


LY + UoYo+ ses +L Yy 
4. Do the statements in the preceding exercises remain correct if we confine 
our attention to real bilinear forms and real linear transformations? 
5. Prove that a necessary and sufficient condition that the form 
LY + Loy2-+ +++ + InYn 


should be unchanged by linear transformations of the 2’s and of the y’s is that 
these be contragredient transformations. 


37. A Geometric Application. Let (2, 72, 23) and (y, Ya, ys) be 
homogeneous codrdinates of points in a plane, and let us consider 
the bilinear equation 


: 3 
(1) 2g MY) = 0. 


If (445 Yoo Y3) iS a fixed point P, then (1), being linear in the 2’s, 
is the equation of a straight line p. The only exception is when the 
coefficients of (1), regarded as a linear equation in the a’s, are all 
zero, and this cannot happen if the determinant of the form is differ- 
ent from zero. Thus we see that the equation (1) causes one, and 
only one, line p to correspond to every point P of the plane, pro- 
vided the bilinear form in (1) is non-singular. 

Conversely, if 


(2) Ax, + Br, + Cr, =0 
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is a line p, there is one, and only one, point P which corresponds to it 
by means of (1), provided the bilinear form in (1) is non-singular. For 
if P is the point (y,, yy, ys), the equation of the line corresponding to it 
is (1), and the necessary and sufficient condition that this line coincide 
puivh (2) is AY 1 + Ayo + U3 = pA, 

AY 1 + Aan + Mogg = pB, 

A311 + Asoo + Asgi/3 = pO; 
where p is a constant, not zero. Fora given value of p, this set of 
equations has one, and only one, solution (y,, Y, yg), since the deter- 


minant @ is not zero, while a change in p merely changes all the y’s 
in the same ratio. Hence, 


THEOREM. [f the bilinear equation (1) is non-singular, tt establishes 
a one-to-one correspondence between the points and lines of the plane. 
This correspondence is called a correlation. 


EXERCISES 


1. Discuss the singular correlations of the plane, considering separately the 
cases in which the rank of the bilinear form is 2 and 1. 


2. Examine the corresponding equation in three dimensions, that is, the equa- 
tion obtained by equating to zero a bilinear form in which n= 4, and discuss it for 
all possible suppositions as to the rank of the form. 


3. Show that a necessary and sufficient condition for three or more lines, 
which correspond to three or more given points by means of a non-singular corre- 
lation, to be concurrent is that the points be collinear. 

4. Show that the cross-ratio of any four concurrent lines is the same as that of 
the four points to which they correspond by means of a non-singular correlation. 

5. Let P be any point in a plane and p the line corresponding to it by means 
of a non-singular correlation. Prove that a necessary and sufficient condition for 
the lines corresponding to the points of p to pass through P is that the bilinear 
form be symmetrical. 

6. State and prove the corresponding theorem for points and planes in space 
of three dimensions, showing that here it is necessary and sufficient that the form 
be symmetrical or skew-symmetrical.* 


* The correlation given by a symmetric bilinear equation is known as a reciproca- 
tion. By reference to the formule of the next chapter, it will be readily seen that in 
this case every point corresponds, in the plane, to its polar with regard to a fixed conic ; 
in space, to its polar plane with regard to a fixed quadric surface. The skew-symmetric 
bilinear equation gives rise in the plane merely to a very special singular correlation. 
In space, however, it gives an important correlation which is in general non-singular 
and is known as a nuill-system. Of. any treatment of line geometry, where, however, 
the subject is usually approached from another side. 


CHAPTER IX 


GEOMETRIC INTRODUCTION TO THE STUDY OF QUADRATIC 
FORMS 


38. Quadric Surfaces and their Tangent Lines and Planes. If 
21, Uy, Lz are homogeneous codrdinates in a plane, we see, by reference 
to §4, that the equation of any conic may be written 


2 2 2 9 ts 
OXF + Aggt2 + gg? 3 + 249% Lq + 2Ayg% Xz + 2Wyg%o%y = 0. 


Similarly, in space of three dimensions, the equation of any quadric 
surface may be written 
2 2 2 2 
Oy X]  Agn®S + Ugg tS + Agyth + 2ayor 2 + 2dygryrg + Baye, 


D) = 
4+ Qa g let, + 2A 45% ,2_ + 2Ay5%o%y = 0. 


This form may be made still more symmetrical if, besides the 
coefficients Ay, dys, Ay4, Aq) Ao U3, We introduce the six other con- 
stants y1, M31, A44, F43) Fog, Igq, defined by the general formula 


a; = Aji» 
The equation of the quadric surface may then be written 
2 
AyyXZ A MyM Vo HF AygXVq + AygVyX, 
2 
HF Ag Vly + AgyVZ 1 Aggly%y + AggVo%y 
2 

HF Ag Ugly + Agy%sVy + AgsZ + Aggy q 

j / 2 — 
H gy Vy®y + AgyMyly + AgyUyL3 + Ayr}? =O, 


or for greater brevity ; 


DEFINITION 1.— The matrix of the sixteen a’s taken in the order 
written above is called the matrix of the quadric surface (1), the deter- 
minant of this matrix is called the discriminant of the quadrie surface, 
its rank is called the rank of the quadric surface, and if the diserimi- 
nant vanishes, the quadric surface is said to be singular. 
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A fundamental problem is the following: If (y1, YY) y4) and 
(21, 2g) 2g, 24) are two points, in what points does the line yz meet the 
surface (1)? 

The coérdinates of any point on yz, other than y, may be written 


(2 + AY 2 + AYoy %y-+ AYsy 24 + AY4)- 


A necessary and sufficient condition for this point to lie on (1) is 


4 
aii, + y;) (2 + AY;) = 9, 
or expanded, , P 


If the point y does not lie on (1), this is a quadratic equation in 2. 
To each root of this equation corresponds one point where the line 
meets the quadric. Thus we see that every line through a point y 
which does not le on a quadric surface, meets this surface either in 
two, and only two, distinct points, or in only one point. 

On the other hand, if y does lie on (1), the equation (2) reduces to 
an equation of the first degree, provided La,y,z,40. In this case, 
also, the line meets the surface in two, and only two, distinct points, 
viz., the point y and the point corresponding to the root of the equa- 
tion of the first degree (2). 

Finally, if 2a,yy;=Za,y2=0, the first member of equation 
(2) reduces to a constant, so that (2) is either satisfied by no value 
of 2, in which case the line meets the surface at the point y only, 
or by all values of A (if 2a,,2,2; = 0), in which case every point on the 
line is also a point on the surface. 

Combining the preceding results we may say: 


THEOREM 1. Jf a quadric surface and a straight line are given, 
one of the following three cases must occur : 

(1) The line meets the quadric in two, and only two, points, in which 
case the line is called a secant. 

(2) The line meets the quadric in one, and only one, point, in which 
case it is called a tangent.* 

(3) Every point of the line is a point of the quadric. In this case 
the line is called a ruling of the quadric.} 

* We shall presently distinguish between true tangents and pseudo-tangents. 


+ Also called a generator, because, as will presently appear, the whole surface may 
be generated by the motion of such a line. 
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That all these three cases are possible is shown by simple exam- 

ples; for instance, in the case of the surface 
y+ 2 — 2 =0, 

the three coordinate axes illustrate the three cases. 

We shall often find it convenient to say that a tangent line meets 
the quadric in two coincident points. 

From the proof we have given of Theorem 1, we can also infer 
the further result : 

THEeorEM 2. Tf (15 Yos Y3s Ys) 18 & point on the quadrie (1), then if 


(3) Saye, =," 


every line through y is either a tangent or a ruling of (1), otherwise 
every line through y which lies in the plane 


4 
(4) Zag ti; =0 


is a tangent or ruling of (1), while every other line through y is a secant. 

A theorem of fundamental importance, which follows immediately 
from this, is: 

THEOREM 3. Jf there exists a point y on the quadric (1) such that 
the identity (3) 2s fulfilled, then (1) ts a cone with y as a vertex ; and, con- 
versely, Uf (1) ts a cone with y as a vertex, then the identity (3) is fulfilled. 

We pass now to the subject of tangent planes, which we define 
as follows: 

DEFINITION 2. A plane p is said to be tangent to the quadric (1) 
at one of its points P, if every line of p which passes through P is either a 
tangent or a ruling of (1). 

It will be seen that, according to this definition, if (1) is a cone, every 
plane through a vertex of (1) is tangent to(1) at this vertex. We have 
thus included among the tangent planes, planes which in ordinary 
geometric parlance would not be called tangent. The same objection 
applies to our definition of tangent Jines. We therefore now intro- 
duce the distinction between true tangent lines or planes and pseudo- 
tangent lines or planes. 


DEFINITION 3. A line or plane which touches a quadrie surface at 
a point which is not a vertex is called a true tangent ; all other tangent 
lines and planes are called pseudo-tangents. 


* It should be noticed that, on account of the relation ay = aj, Dayxiy; = Dayyiay. 
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EXERCISES 


1. Prove that if P isa point on a quadric surface S, which is not a vertex, 
and p the tangent plane at this point, one of the following three cases must occur : 

(a) Two, and only two, lines of p are rulings of S, and these rulings intersect 
at P. 

(6) One, and only one, line of p is a ruling of S, and this ruling passes 
through P. i 

(c) Every line of p is a ruling of S. 


2. Prove that 

(a) When case (a) of Exercise 1 occurs, the quadric surface is not a cone; 
and, conversely, if the quadric surface is not a cone, case (a) will always occur. 

(5) If case (6) of Exercise 1 occurs, p is tangent to S at every point of the 
ruling which lies in p. 

(c) If case (6) of Exercise 1 occurs, S is a cone with one, and only one, vertex, 
and this vertex is on the ruling which lies in p; and conversely, if S is a cone with 
one, and only one, vertex, case (>) will always occur. 

(d) If case (c) of Exercise 1 occurs, there is a line / in p every point of which 
(but no other point) is a vertex of S; and S consists of two planes one of which 
is p, while the other intersects it in /. 

& 
39. Conjugate Points and Polar Planes. Two points are com- 


monly said to be conjugate with regard to a quadric surface 
4 
(1) Lyi Li; > 0, 


when they are divided harmonically by the points where the line 
connecting them meets the surface. In order to include all limiting 
cases, we frame the definition as follows: 


DEFINITION. Two distinct points are said to be conjugate with re- 
gard to the surface (1) of 

(a) The line joining them is a tangent or a secant to (1), and the 
points are divided harmonically by the points where this line meets (1); or 

(b) The line joining them is a ruling of (1). 

Two coincident points are called conjugate if they both le on (1). 

Let the codrdinates of the points be (¥;, Yo: Yar Y4) ANA (2, 2g) gy Z4)s 
and let us look first at the case in which the points are distinct and 
neither of them lies on (1), and in which the line connecting them is 
a secant of (1). The points of intersection of the line yz with (1) 
may therefore be written 


(2 + AW yy Za + Adar 23 +i Yar % + a Ya) (= 1, 2), 
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where A, and A, are the roots of Equation (2), § 38. A necessary 
and sufficient condition for harmonic division is that the cross-ratio 
Ay / Aq have the value —1; that is\,+2,=0; or, referring back 
to Equation (2), § 38, 


4 
(2) Zig Ye; =0. 


We leave it for the reader to show that in all other cases in which y 
and z are conjugate this relation (2) is fulfilled ; and that, conversely, 
whenever this condition is fulfilled, the points are conjugate. Thatis: 


THEOREM 1. A necessary and sufficient condition that the points 
y, @ be conjugate with regard to (1) is that (2) be fulfilled. 


This theorem enables us at once to write down the equation of 
the locus of the point 2 conjugate to a fixed point y, namely, 


4 
(3) Zaye; =0. 


Except when the first member of this equation vanishes identically, 
this locus is therefore a plane called the polar plane of the point y. 
We saw in the last section that the first member of (3) vanishes 
identically when (1) is a cone and y isa vertex. This is the only 
case in which it vanishes identically ; for, if y is any point, not a 
vertex, on a quadric surface, (8) represents the tangent plane at that 
point; while if y is not on (1), the first member of (3) can clearly not 
vanish identically, since it does not vanish when the z’s are replaced 
by the y’s. Hence the theorem : 


~TuHeEoreM 2. Jf (1) ts not a cone, every point y has a definite polar 
plane (3); if (1) is a cone, every point except its vertices has a definite 
polar plane (3), while for the vertices the first member of (3) is identi- 
cally zero. 


We note that the property that a plane is the polar of a given 
point with regard to a quadric surface is a projective property, since 
a collineation of space evidently carries over two conjugate points 
into points conjugate with regard to the transformed surface. 


THEOREM 38. Jf two points P, and P, are so situated that the 
polar plane of P, passes through P., then, conversely, the polar plane 
of P, will pass through P,. 
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For, from the hypothesis, it follows that P, and P, are conjugate 
points, and from this the conclusion follows. 


40. Classification of Quadric Surfaces by Means of their Rank. 
Theorem 2 of the last section may be stated by saying that a neces- 
sary and sufficient condition that the quadric surface 


( 1) = ; : UjyjzX iL; = 0 


be a cone and that (y, y2, Ys, ys) be its vertex (or one of its vertices) 
is that 


4 
(2) Laytiy; =0. 


This identity (2) is equivalent to the four equations 


AY 1 + AyYo +343 + AY, = 9, 
(3) oY + Aa9Yo + Ag3/3 + AogY4 = 0, 
A311 + UgoYo + AssYg t+ AssYy = 9, 
By YF MY + Ay3Ygt AyyYy = 9- 


A necessary and sufficient condition for this set of equations to 
have a common solution other than (0, 0, 0, 0) is that the determi- 
nant of their coefficients be zero. We notice that this determinant 
is the discriminant a of the quadric surface. Hence, 


THEOREM 1. A necessary and sufficient condition for a quadric 
surface to be a cone is that its discriminant vanish, 


Tf, then, the rank of the quadric surface is four, the surface is not 
a cone. 

If the rank is three, the set of equations (8) has one, and, except 
for multiples of this, only one, solution. Hence in this case the sur- 
face is an ordinary cone with a single vertex. 

If the rank is two, equations (3) have two linearly independent solu- 
tions (cf. §18), on which all other solutions are linearly dependent. 
Hence in this case the surface is a cone with a whole line of vertices. 

If the rank is one, equations (8) have three linearly independent 
solutions on which all other solutions are linearly dependent. Hence 
we have a cone with a whole plane of vertices. 

If the rank is zero we have, strictly speaking, no quadric surface ; 
but the locus of (1) may be regarded as a cone, every point in space 
being a vertex. 
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It is clear that the property of a quadric surface being a cone is 
a projective property ; and the same is true of the property of a point 
being a vertex of a cone. Hence from the classification we have 
just given we infer 


THEOREM 2. The rank of a quadric surface is unchanged by non- 
singular collineations. 


EXERCISES 


1. Derrnirion. Jf a plane p is the polar of a point P with regard to a quadric 
surface, then P is called a pole of p. 

Prove that if the quadric surface is non-singular, every plane has one, and only 
one, pole. 


2. Prove that if the quadric surface is a cone, a plane which does not pass 
through a vertex has no pole. 
What can be said here about planes which do pass through a werten? 


41. Reduction of the Equation of a Quadric Surface to a Normal 
Form. Since cross-ratio is invariant under a non-singular collinea- 
tion, a quadric surface S, a point P, not on S, and its polar plane 
with regard to S, are carried over by any non-singular collineation 
into a quadric surface S’, a point P’, and its polar plane rae regard 


to S’. A point (Y1, Yo, Ys Ys) not 2 the quadri ic surface Sao; ==, 
cannot be on its own polar plane Sa, jiYj = 0 as we see by replacing 


the 2’s in this last equation by the y’s. Now transform by a colline- 
ation so that this point becomes the origin and its polar plane the 
plane at infinity.* The quadric surface will now be a central quad- 
ric with center at the origin, since, if any line be drawn through the 
origin, the two points in which this line meets the surface are divided 
harmonically by the origin and the point at infinity on this line. 

The equation of the polar plane of the point (yj, yj, y§, y{) with 
regard to the transformed quadric 


ee, 
Laie; = 0 
5 é Ui aabagl 
is ZMighiY ==\0, 


* Such a collineation can obviously be determined in an infinite number of ways 
by means of the theorem that there exists a collineation which carries over any five 
linearly independent points into any five linearly independent points; cf. Exercises 2, 
8, § 24. 
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which reduces to the simple form 
Uy, + AyX, + Ayers + Ayr{ = 0 
when the point is the origin (0, 0, 0,1). For this equation to rep- 
resent the plane at infinity, we must have 
Why = Ay = A, = 0, ah, #0. 
Hence the quadric surface becomes 
Gy TP + Ay B+ ay 2} 2 
yy Wy Uy + yy UP + hy Wy US 
+ gy %3 2 + Ayy Wy X + yy UP 
. + al, 2/2 =0. 

A slightly different reduction can be performed by transforming 
the point (41, Ya, Y3 ¥4) to the point at infinity on the 2,-axis and its 
polar plane to the 2,7,-plane. It is easy to see that we thus get rid 
of the terms containing x, except the square term. 

Similarly we can get rid of the terms containing x, and 25. 
Thus we see that any quadric surface can be reduced by a collineation 
to a form where its equation contains no term in x, except the term in 
x? whose coefficient then ts not zero. . 

According as we take for 7 the values 1, 2, 3, 4, we get thus four 
different normal forms for the equation of our quadric surface, and 
inasmuch as each of these forms can be obtained in a great variety 
of ways, the question naturally arises whether we cannot perform 
all four reductions simultaneously. That this can, in general, be 
done may be seen as follows: let y be a point not on the quadric 
surface, and zg any point on the polar plane of y, but not on the 
quadric surface. Its polar plane contains y. Let w be any point on 
the intersection of the polar planes of y and zg, but not on the quadric 
surface. Then its polar plane passes through yand z. ‘These three 
polar planes meet in some point w, and it is readily seen that the four 
points y,z,w,u do not lie onaplane. The tetrahedron yzwu is called 
a polar or self-conjugate tetrahedron of the quadric surface, since it 
has the property that any vertex is the pole of the opposite face. 

If we transform the four points, y, z, w, w to the origin and the 
points at infinity on the three axes, the effect will be the same as that 
of the separate transformations above, that is, the equation of the 
quadric surface will be reduced to the form 


! 12 ! /12, ! 12, ! ee 
Ay TP + Ag, LP + Ags TH+ Ay VZ= 0. 
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We have tacitly assumed that it is possible to find points y, z, w, 
constructed as indicated above, and not lying on the quadric surface. 
We leave it for the reader to show that, if the quadric surface is not 
a cone, this will always be possible in an infinite number of ways. 
A cone, however, has no self-conjugate tetrahedron, and in this case 
the above reduction is impossible. 


EXERCISES 


1. Prove that if the discriminant of a quadric surface is zero, the equation of 
the surface can always be reduced, by a suitable collineation, to a form in which 
the codrdinate x, does not enter. 

[Suecxrstion. Show, by using the results of this chapter, that if the vertex of a 
quadric cone is at the origin, @14 = d24 = 34 = dag = 0.] 


2. Show that, provided the cone has a finite vertex, the collineation of 
Exercise 1 may be taken in the form 


ace 
LH M+arly 


ne 
t= 2+ Bry 
(aes 
%g=%etyXy 
r= 24. 


[Sucexstion. Use non-homogeneous codrdinates. | 


CHAPTER X 
QUADRATIC FORMS 


42. The General Quadratic Form and its Polar. The general 
quadratic form in x variables is 


n 
= 2 
(1) Daya; SAT AA y9% Tot... + Ayn Ly, 


HF Ay X04 + Any} H+ 00. + An ®y 


Hb Any Ln®y + AngVnLe + +00 + AnnZ?, 


where a, =a,;.* The bilinear form Saye; is called the polar form of 
(1). Subjecting (1) to the linear transformation 


en, ’ 
Ly = Cy® eee + Cp Fry 
aepeea] ! 
Ly = CX + -0> + Canny 
we get a new quadratic form 
n 
2 Dalziel. 
( ) 1 ett at | 


The polar form of (2) is Zajyjz;. If we transform the y’s and 2’s of 
the polar form of (1) by the same transformation c, we get a new 
bilinear form Layy iz). We will now prove that a, = aj,. 

We have the identities 


n n 
me [ole | 
(3) Layne = LUjLikjy 


(4) Bdiyy A; = LAs yey. 


* Tt should be clearly understood that this restriction is a matter of convenience, 
not of necessity. If it were not made, the quadratic form would be neither more nor 
less general. 
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Each of these we may regard as identities in the a!’s, ys, 2’’s, the 
2's, y’s, 2’8 being merely abbreviations for certain polynomials in the 
corresponding primed letters. The last written identity reduces, 
when we let yf =z! =a) (t=1, 2,...%), to 


n n 
ry ee 


Combining this with (8) gives 
on eal : Th nal! 
DM iguih; = DMV. 
a, if 


se _ pS, 5] l 
Hence Cy = Ui and ay af a= dig Dis 


We have assumed that aj,=a;, these being merely the coefficients of 


a certain quadratic form, and we proved, in Theorem 4, § 36, that 
a, = 4;, Hence we infer that a, = a};. 
From this fact and from (4) we get at once the further result : 


n n 

DA 
Ly i2j = DUgY 2). 
That is : 


THEOREM. The polar form 
Sayy 2; 
is an absolute covariant of the system composed of the quadratic form 
Sayre; 
and the two points CYiy es Wink (2am a2, 


43. The Matrix and the Discriminant of a Quadratic Form. 


DEFINITION. The matrix 


Bj tm Org 


TARA 
is called the matrix of the quadratic form 
n 


The determinant of a is called the discriminant of (1); and the rank of 
a, the rank of (1). If the discriminant vanishes, (1) is called singular. 
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The matrix of (1) is the matrix of its polar form. Moreover, as 
was shown in the last section, if the 2’s in (1) are subjected to a 
linear transformation, and the y’s and 2’s in the polar of (1) are sub- 
jected to the same transformation, the matrix of the new quadratic 
form will be the same as the matrix of the new bilinear form. But 
we saw, in Theorem 1, $36, how the matrix of a bilinear form is 
changed by linear transformations of the variables. Thus we have 
the theorem : 


THEorEM 1. Jf in the quadratic form (1) with matrix a we sub- 
ject the xs to a linear transformation with matrix c, we obtain a new 
quadratic form with matrix c'ac, where c' is the conjugate of c. 


From this there follow at once, precisely as in §36, the further 
results : ; 

THEOREM 2. The rank of a quadratie form is not changed by non- 
singular linear transformation. 


THEOREM 3. The discriminant of a quadratic form is a relative 
invariant of weight two. 


44. Vertices of Quadratic Forms. 
DEFINITION. By a vertex of the quadratic form 


. n 
el ) BARE 
we understand a point (c4, +++ C,) where the es are not all zero, such that 
n 
(2) Laie; == Q. 


A quadratic form clearly vanishes at all of its vertices. 
It is merely another way of stating this definition when we say : 


THrorEeM 1. <A necessary and sufficient condition that (¢4, +++ ¢,) be 
a vertex of (1) is that it be a solution, not consisting exclusively of zeros, 
of the system of equations 


Ay10y Hv + Aynln =9, 
(3) 


Any €y + -++ + Annln = 9. 
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Since the resultant of (3) is the discriminant of (1), we may add: 


THEOREM 2. A necessary and sufficient condition for a quadratic 
form to have a vertex is that its discriminant be zero; and if the rank 
of the form is r, it has n —r linearly independent vertices, and every point 
linearly dependent on these is a vertex. 

In particular, we note that if the discriminant of a quadratic 
form is zero and if the cofactors of the elements of this determinant 
are denoted in the ordinary way by A,, then (Aj, --- A;,) is a vertex, 
provided all these A’s are not zero. 

The following identity is of great importance (cf. formula 


(2), § 88), 
(4) Say (+ AY) (4 +AY;) = Saye, +2 rZay 2iYj + NS ay Yi 
This may be regarded as an identity. in all the letters involved. 

If (¢,, +++ ¢,) is a vertex of the quadratic form Say ,v;, and these 
e's are substituted in (4) in place of the y’s, the ae two terms of 
the second member of this identity are zero, and we have 


(5) Say (2; + de;) (2; + re; ) => Uj (2; 3 


and conversely, if (5) holds, (¢,, -- * én) is a vertex; for subtracting 
(5) from (4), after substituting the ¢’s for the ys in res we have 


2rZay ;2iCj +? Say 6G; 


ji 

and, this being an identity in ) as well as in the z’s, we have 
n 
LVAyZ,C; = 0. 
1 


Thus we have proved the following theorem: 


THEOREM 3. A necessary and sufficient condition that (¢,, ++ ¢,) 
be a vertex of the quadratic form (1) is that 2, ---z, and X being in- 
dependent variables, the identity (5) be fulfilled. 


EXERCISES 


1. Prove that if (¢,,...c,) is a vertex of (1), and (y,,...yn) is any point at 
which the quadratic form vanishes, then the quadratic form vanishes at every 
point linearly dependent on c and y. 


2. State and prove a converse to 1. 
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45. Reduction of a Quadratic Form to a Sum of Squares. If in 
the quadratic form 


(1) é f (ay +++ %,) = 2Ayj Xi; 


the coefficient a, is not zero, we may simplify the form by the fol- 
lowing transformation due to Lagrange. 
The difference 


¢ (ry a, (a apes Ain®n)” 
is evidently independent of z;. Denoting it by ¢,, we have 
1 
$ a Pace! Leone CA 1 py: 


If, then, we perform the non-singular linear transformation 
Ui 
Ly FH Ay, Fo Uig%, + +++ +Ain®y 
oe 
oe tga X 


ete 


eae 
Ln a : Lny 


the quadratic form ¢ is reduced to the form 
1 
@) ae 1 (@y ++ M)s 


in which all the terms in x are wanting except the term in 7. 

It will be seen that this reduction can in general be performed in 
a variety of ways. It becomes impossible only when the coefficients 
of all the square terms in the original quadratic form are zero. 

Unless, in the new quadratic form ¢,, the coefficients of all the 
square terms are zero, we can apply the same reduction to this form 
by subjecting the variables z}, --- 2, to a suitable non-singular linear 
transformation. This transformation may also be regarded as a non- 
singular linear transformation of all the w'’s: (a1, vj, --- v),) if we write 
vi =2z}. We thus reduce (3) to the form 


(4) © al + al + dy (hy 2%), 
i ay 

Applying this reduction now to ¢,, and proceeding as before, we 
see that by a number of successive non-singular transformations the 


form ¢ can finally be reduced to the form: 
(5) caf + ey + + + oy 2%. 
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These successive transformations can now be combined into a single 
non-singular linear transformation, and we are thus led to the 


THEOREM. very quadratic form in n variables can be reduced to 
the form (5) by a non-singular linear transformation. 

The proof of this theorem is not yet complete; for if at any 
stage of the reduction the quadratic form ¢; has the peculiarity that 
all its square terms are wanting, the next step in the reduction will be 
impossible by the method we have used. Before considering this point, 
we will illustrate the method of reduction by a numerical case. 

Example. 

222+ 24%,+82,2% 
P=i+ %%,— 34 +9 mr) = 322, +4, +843)? + dy 


+82%,%,+9 x32, + 2 22 
where ; : ; ; —ia2 + 5ayxe, 
1= —4(a, + 8 2x3) — 8434+ 18 2,.%,+ 2 = +5 2,2, — 3022 


= 2__ 160 
= —#(— $4, +5 2,)?— 160 23. 


Accordingly, by means of the non-singular linear transformation 
a =20,+2, +8425, 
Uy = —$%,+ 52, 
3 = 3p 
the form ¢ reduces to 3}a?—2xe2—1$%22. 
We have given here merely one method of reduction. Three differ- 
ent methods were open to us at the first step and two at the second. 
We proceed now to complete the proof of the general theorem. 
Let us suppose that the coefficients of all the square terms in ¢ are 
zero,* but that a,,40. Then 
Py 01° Ly) 2 yyy + 2 Wy (Ayg%y + +++ + Ayyp) 
+ 2 2 ( Gog%— + +++ + AanLn + Saxe; 
3 
2 
= 5 kaa H+ yyy + ++ + Ayn ®n) (Ay Py + Aggy + +++ + Ayn Ly) 
12 
+ $4, 


9 n 
where $,=-— 7 kas + s+ + Ayn %n) (Agg%g + +++ + Ayn %y) + La yt y- 
12 


* This method may be used whenever aj;=a22=0 whether all the other coef- 
ficients aj; are zero or not. 
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The non-singular linear transformation 


re 
/ 
Ly = Agi Ty H Agg%s + +++ + Aon Ly 
v3 = L3 
fs lage. 
= Bije 


thus reduces ¢ to the form 

2 
— &1 5 + Py(Lgy ++ Ly). 
ee 


The further non-singular transformation 
his ell i! 
T= %4+2%, 
{| ae ! 
tue, 
== ld 
2 es 


ch m4 


reduces ¢ to the form 
Tee eo 
2 Gyn 2a 





2? + Py(U3y ++ Le 
12 


The above reduction was performed on the supposition that a,+0. 
It is clear, however, that only a slight change in notation would be 
necessary to carry through a similar reduction if a,,=0 but a,+ 0. 
The only case to which the reduction does not apply is, therefore, 
the one in which all the coefficients of the quadratic form are zero, 
a case in which no further reduction is necessary or possible. 

We thus see that whenever Lagrange’s reduction fails, the method 
last explained will apply, and thus our theorem is completely es- 
tablished. 

EXERCISES 

1, Given a quadratic form in which n= 5 anda; =|i—j|. Reduce to the 
form (5). 

2. Reduce the quadratic form 

922-6 y? —8 224 bry — 14224 18 2w +8 yz +12 yw —4 ew 
to the form (5). 
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3. Prove that if (y,,... yn) is any point at which a given quadratic form is 
not zero, a linear transformation can be found (and that in an infinite number of 
ways) which carries this point into the point (0,--. 0, 1) and its polar into kz,; 
and show that this linear transformation eliminates from the quadratic form all 
terms in x, except the term in x2 which then has a coefficient not zero. 


4. Prove that the transformations described in Exercise 3 are the only ones 
which have the effect there described. 


5. Show how the two methods of reduction explained in this section come as 
special cases under the transformation of Exercise 3. 


46. A Normal Form, and the Equivalence of Quadratic Forms. 
In the method of reduction explained in the last section, it may 
happen that, after we have taken a number of steps, and thus 
reduced ¢ to the form 


Cy tov + Gre + hy (Lesas *** Ln)s 


the form ¢, is identically zero. In this case no further reduction 
would be necessary and the form (5) of the last section to which ¢ is 
reduced would have the peculiarity that ¢&,,=¢..= -- =¢,=0, 
while all the earlier e’s are different from zero. It is easy to see just 
when this case will occur. 

For this purpose, consider the matrix 


te ear 
(i ia, se 0 
yy eee 


nN 


of the reduced form (5) of § 45. It is clear that the rank of this 
matrix is precisely equal to the number of e’s different from zero ; 
and, since the rank of this reduced form is the same as that of the 
original form, we have the result : 


THEOREM 1. A necessary and sufficient condition that it be possi- 
ble to reduce a quadratic form by means of a non-singular linear trans- 
formation to the form 


(1) ett om + 608 


where none of the c’s are zero, is that the rank of the quadratic form be r. 
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This form (1) involves r coefficients ¢,, --: ¢,. That the values of 
these coefficients, apart from the fact that none of them are zero, are 
immaterial will be seen if we consider the effect on (1) of the trans- 


formation ' a 
i 
= afH2!, 
Cy 
(2) Lt, = 7) 


where &,, --- &, are arbitrarily given constants none of which, how- 
ever, is to be zero. The transformation (2) is non-singular, and 
reduces (1) to the form 
(3) kywe+ + + k,al2. 
Thus we have proved 

THEOREM 2. <A quadratic form of rank r can be reduced by means 
of a non-singular linear transformation to the form (3), where the values 


of the constants k,, --- k, may be assigned at pleasure provided none of 
them are zero. 


If, in particular, we assign to all the k’s the value 1, we get 


THEOREM 8. very quadratic form of rank r can be reduced to 
the normal form 
(4) Bt +2 


by means of a non-singular linear transformation. 


From this follows 

THEOREM 4. A necessary and sufficient condition that two quad- 
ratie forms be equivalent with regard to non-singular linear trans- 
formations is that they have the same rank. 

That this is a necessary condition is evident from the fact that the 
rank is an invariant. That it is a sufficient condition follows from 
the fact that, if the ranks are the same, both forms can be reduced 
to the same normal form (4). 


X 
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The normal form (4) has no special advantage, except its sym- 
metry, over any other form which could be obtained from (8) by 
assigning to the #’s particular numerical values. Thus, for instance, 
a normal form which might be used in place of (4) is 


wet + a2, — 2. 


This form would have the advantage, in geometrical work, of giving 
rise to a real locus. 

Finally we note that the transformations used in this section are 
not necessarily real, even though the form we start with be real. 


EXERCISE 
Apply the results of this section to the study of quadric surfaces. 


47. Reducibility. A quadratic form is called reducible when it is 
identically equal to the product of two linear forms, that is, when 


(1) ~ OU Xj = (Dyxy + dy wg + +++ + .b,0_)(Cy@y H Cylq + +++ + Cp): 


Let us seek a necessary and sufficient condition that this be the case. 
We begin by supposing the identity (1) to hold, and we consider in 
succession the case in which the two factors in the right-hand mem- 
ber of (1) are linearly independent, and that in which they are pro- 
portional. In the first case the 6’s are not all proportional to the 
corresponding e’s, and by a mere change of notation we may insure 
6,, 6, not being proportional to ¢,, ¢. This being done, the trans- 
formation 

a = b,2, + boty + +++ +b,2, 
Lh = C12 + Cy%y ++ +62, 


ie 
0, Lg. 


a, = Ly 


is non-singular and carries our quadratic form over into the form 
! pl 
rns 


The matrix of this form is readily seen to be of rank 2, hence the 
original form was of rank 2. 
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Turning now to the case in which the two factors in (1) are 
proportional to each other, we see that (1) may be written 


~ Aviv; = O(byry + +++ + bya) where (+0. 


Unless all the 6’s are zero (in which case the rank of the quadratic 
form is zero) we may without loss of generality suppose 6,0, in 
which case the linear transformation 


Aes 
w= b,a,+ as + b,Un 
iy 
43> Ly 


_— 
Ly, = Xn, 


will be non-singular and will reduce the quadratic form to 
pp 
which is of rank 1. Cn, 
Thus we have shown that if a quadratic form is reducible, its 
rank is 0, 1, or 2. We wish now, conversely, to prove that every 
quadratic form whose rank has one of these values is reducible. 
A quadratic form of rank zero is obviously reducible. 
A form of rank 1 can be reduced by a non-singular linear trans- 
formation to the form zis that is, 


> Ug VX; = Ly. 


If here we substitute for z, its value in terms of the 2’s, it is clear 
that the form is reducible. 
A form of rank 2 can be reduced to the form 2!” +2, that is, 


dryeae +a) =e + Vb al)(a =v 1a). 
Here again, replacing x} and x, by their values in terms of the 2’s, 
the reducibility of the form follows. Hence, 


THEOREM. A necessary and sufficient condition that a quadratic 
form be reducible is that its rank be not greater than 2. 


48. Integral Rational Invariants of a Quadratic Form. We have 
seen that the discriminant a of a quadratic form is an invariant of 
weight 2. Any integral power of a, or more generally, any constant 
multiple of such a power, will therefore also be an invariant. We 
will now prove conversely the 

THEOREM. LHvery integral rational invariant of a quadratic form 
is a constant multiple of some power of the discriminant. 
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Let us begin by assuming that the quadratic form 
(1) Lay UX; 
is non-singular, and let ¢ be the determinant of a linear transforma- 
tion which carries it over into the normal form 
(2) ae ab te +--+ +2/2. 
Let I (a4, +++ Gn) be any integral rational invariant of (1) of weight p, 
and denote by & the value of this invariant when formed from (2). 


It is clear that & is a constant, that is, independent of the coefficients 
a; of(1). Then yap 


Moreover, the discriminant a being of weight 2, and having for (2) 


the value 1, we have ees 


Raising the last two equations to the powers 2 and yw respectively, 


we get Bal, 1a Par. 
From which follows 
(3) [2 = kPa". 


This formula has been established so far for all values of the 
coefficients ay for which a#0. ‘That it is really an identity in the 
as is seen at once by a reference to Theorem 5,§2. The poly- 
nomial on the right-hand side of (3) is of degree mw in a,,;* hence 
we see that # must be an even number, since J? is of even degree in 
ay. Letting » = 2», we infer from (3) (cf. Exercise 1, § 2) that one 
or the other of the identities 

I= ka’, I=— ka’ 


must hold, and either of these identities establishes our theorem. 

A comparison of the result of this section with Theorem 4, § 46 
will bring out clearly the essential difference between the two con- 
ceptions of a complete system of invariants mentioned in § 29. It will 
be seen that the rank of a quadratic form is in itself a complete sys- 
tem of invariants for this form in the sense of Definition 2, § 29; 
while the discriminant of the form is in itself a complete system in 
the sense of the footnote appended to this definition. 


* We assume here that k=£0, as otherwise the truth of the theorem would be 
obvious. 
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49. A Second Method of Reducing a Quadratic Form to a Sum of 
Squares. By the side of Lagrange’s method of reducing a quadratic 
form to a sum of squares, there are many other methods of accom- 
plishing the same result, one of the most useful of which we pro- 
ceed to explain. It depends on the following three theorems. The 
proof of the first of these theorems is due to Kronecker and estab- 
lishes, in a remarkably ‘simple manner, the fact that any quadratic 
form of rank r can be written in terms of r variables only, a fact 
which has already been proved by another method in Theorem 1, § 46. 


THEOREM 1. Jf the rank of the quadratic form 
Q) Bee ~-%) = Saye 


is r > 0, and tf the variables x,, --- x, are so numbered that the r-rowed 
determinant in the upper left-hand corner of its matrix is not zero,* 
new variables x}, ---x, can be introduced by means of a non-singular 
linear transformation such that 


= 7; G=r+l, - N), 
and such that (1) reduces to the form 
Lay x a, 


This, it will be noticed, is a quadratic form in r variables in which 
the coefficients, so far as they go, are the same as in the given form (1). 

In order to prove this theorem, we begin by finding a vertex 
(Cy, -*- &,) of the form (1) by means of Equations (3), § 44. Since the 
r-rowed determinant which stands in the upper left-hand corner of 
the matrix of these equations is not zero, the values of ¢,,;, --- ¢, may 
be chosen at pleasure, and the other ¢’s are then completely deter- 
mined. If we let ¢1, = ¢p4g= °** = ¢n_1=9, ¢, =1, we get a vertex 


(onc, 04 0.01), 
Using this vertex in the identity (5), § 44, we have 
f(y + Alyy 1+ Vp AC py Lpzys *** Lays Un +) = H(@yy +++ La) 
If we let X= —z,, this identity reduces to 
fp (ay — Cy Bns °** Lp — Cp Lng Ly y9 °° En 0) =H (Wy *** Lp) 


* That such an arrangement is possible is evident from Theorem 3, § 20. 
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Accordingly, if we perform the non-singular linear transformation * 
ewes (¢= 1-77), 


C= 2; (¢=r+1, ---n), 


the quadratic form (1) reduces to 
n-1 
(24, es Dieu 0) = Lay ity 


This, being a quadratic form in n—1 variables of rank r and so 
arranged that the r-rowed determinant which stands in the upper 
left-hand corner of its matrix is not zero, can be reduced, by the 
method just explained, to the form 


n= 2 
Says a} 
where the linear transformation used is non-singular and such that 
es (¢=r+1,---n—1). 
By adding the formula z)/=di, 
we may regard this as a non-singular linear transformation in the n 


variables. ‘This transformation may then be combined with the one 
previously used, thus giving a non-singular transformation in which 


al! = 2, (¢=r+l1,---n), 
and such that it reduces (1) to the form 


Se all 
BAX; Lj 


Proceeding in this way step by step, our theorem is at last 
proved. 

In the next two theorems we denote by A;; in the usual way the 
cofactor of a, in the discriminant a of the quadratic form (1). 


THEOREM 2. If A,,+90, new variables x\, --- a! can be introduced 
by a non-singular transformation in such a way that 


Un = Lp 


and that (1) takes the form 


ane ee rn ea 
2a; uv; a; + | Un . 
1 nn 


* This transformation should be compared with Exercise 2, § 41. 
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To prove this we consider the quadratic form 
a n a 

Pp Ae 

Dd U{Xj Le 


n 
Ais 


Its discriminant is 








yy see A n-1 An 
a 
=a —Ann, = 0. 
G-1,1 “* F-1,n-1 UM-1yn i ie 
Any ace An, n-1 Ann — A 


nn 


Hence by means of a non-singular transformation of the kind used 
in the last theorem, an essential point being that z/ = x,, we get 








n a ‘ n-1 et 
J Ay X;X; — Ue = Vay UL, 
nn 
or 4 el (Pat) a 
= i 
= Uy LX; = VAyU, a, + aA. 
1 : nn 


THEOREM 38. Jf 
Aaj — ya — 0, Ane x 0, 


new variables x}, --- x, can be introduced by a non-singular transforma- 
tion in such a way that ; i 

Ly—1 = Ln—p n= Ln) 
and that (1) takes the form 


Dae ys 
CD ae 





— el 
= Ay t,x; + 
nN, n-1 


Let us denote by’ B the determinant obtained by striking out the 
last two rows and columns of a. ‘Then (cf. Corollary 8, § 11) we 





have vo. Vereen 

(2) aB= | ve as Ae (me — AZ, 0. 
Consider, now, the quadratic form 

(3) Sayer, — es LnLn—1 


n,n=1 
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Its discriminant is 











Ay MN, n-1 An 
4 a 
( ) A—1,1 °° %n-1, n-1 Bn-1,n — A 
n,n-1 
e a 
Oe Anne Teme mn 
: 7 1 
a a a 
= — Ay, nq ~ An, —B \s 
n, n—1 Nn, n-1 n,n-1 


which has the value zero, as we see by making use of (2). Not only 
does the determinant (4) vanish, but its principal minors obtained 
by striking out its last row and column and its next to the last row 
and column are zero, being A,,, and A,_,,_, respectively. The minor 
obtained by striking out the last two rows and columns from (4) is 
B, and, by (2), this is not zero. Thus we see (cf. Theorem 1, § 20) 
that the determinant (4) is of rank n—2. Hence, by Theorem 1, 
we can reduce (3) by a non-singular linear transformation in which 
ef = Tp 1; t= 2, to the form 





2a pee ale 
2a, jUiLy — —— 2p,Lp_ = LMjjUij- 
n,n-] 
< (edi 2a bear 
Hence 2a LL; = Sa lity + eae 
Ny RI 


CoRoLLARY. Under the conditions of Theorem 3, the quadratic 
form (1) can be reduced to the fs 





n-2 i 
Pg pay + (a yan fa 


nae 


n,n-1 


by a non-singular linear transformation. 


To see this we have merely first to perform the reduction of 
Theorem 3, and then to follow this by the additional non-singular 
transformation 


al = a! (i=1, 2,--n— 2), 


U " 
Ly C= n- 1th, 


whom a, all 
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Having thus established these three theorems, the method of 
reducing a quadratic form completely is obvious. If the form (1) is 
singular, we begin by reducing it by Theorem 1 to 


* 
~ Ug Ui; L5y 


where r is the rank of the form. Unless all the principal (r—1)- 
rowed minors of the discriminant of this form are zero, the order 
of the variables 2,,---z, can be so arranged that the reduction of 
Theorem 2 is possible, a reduction which may be regarded as a non- 
singular linear transformation of all n variables. If all the princi- 
pal (xr —1)-rowed minors are zero, there will be at least one of the 
cofactors A;; which is not zero, and, by a suitable rearrangement of 
the order of the variables, this may be taken as A,,_,. The reduc- 
tion of Theorem 3, Corollary, will then be possible. Proceeding in 
this way, we finally reach the result, precisely as in Theorem 1, § 46, 
that a quadratic form of rank r can always be reduced by a non- 
singular linear transformation to the form 


Cat + ++ + 6,22. 


It may be noticed that the arrangement of the transformation of 
this section is in a certain sense precisely the reverse of that of § 45, 
inasmuch as we here leave at each step the coefficients of the unre- 
duced part of the form unchanged, but change the variables which 
enter into this part; while in §45 we change the coefficients of the 
unreduced part, but leave the variables in it unchanged. 


CHAPTER XI 
REAL QUADRATIC FORMS 


50. The Law of Inertia. We come now to the study of real 
quadratic forms and the effect produced on them by real linear 
transformations. 

We notice, here, to begin with, that the only operations involved 
in the last chapter are rational operations (7.e. addition, subtraction, 
multiplication, and division) with the single exception of the radicals 
which come into formula (2), §46. In particular the reduction of 
§ 45 (or the alternative reduction of § 49) involves only rational oper- 
ations. Consequently, since rational operations performed on real 
quantities give real results, we have 


THEOREM 1. A real quadratic form of rank r can be reduced by 
means of a real non-singular linear transformation to the form 


12, 12 I 
(1) CP + cyte + --- + 6a? 
where cy, +++ ¢, are real constants none of which are zero. 


As we saw in the last chapter, this reduction can be performed ina 
variety of ways, and the values of the coefficients c,, --- c, in the reduced 
form will be different for the different reductions. The signs of these 
coefficients, apart from the order in which they occur, will not depend 
on the particular reduction used, as is stated in the following im- 
portant theorem discovered independently by Jacobi and Sylvester 
and called by the latter the Law of Inertia of Quadratic Forms: 


THEOREM 2. If areal quadratic form of rank r is reduced by two 
real non-singular linear transformations to the forms (1) and 
(2) Feyary? + hye? + oe + heyy, 
respectively, then the number of positive es in (1) is equal to the number 
of positive k’s in (2). 

In order to prove this, let us suppose that the a’’s and 2'!’s have 
been so numbered that the first w of the ec’s and the first v of the k’s 


are positive while all the remaining ¢’s and k’s are negative. Our 
: 144 
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theorem will be established if we can show that w= v. If this is 
not the case, one of the two integers uw and v must be the greater, 
and it is merely a matter of notation to assume that w> v. We will 
prove that this assumption leads to a contradiction. 

If we regard the z’’s and z'’’s simply as abbreviations for certain 
linear forms in the 2’s, (1) and (2) are both of them identically equal 
to the original quadratic’ form, and hence to each other. This iden- 
tity may be written | 


(3) qef t+ + +¢,02—|e, , 2.1 — + —le le? 


shal? + + + heya! — |b, 4 |! 1 — + —|egll2. 


Let us now consider the system of homogeneous linear equations 
an (2, --- Zpn)s 
" ! U / 
(4) al = 0, --- 2) = 0, 2, 4,= 0, --- 2, = 0. 


We have here v+n—p<n equations. Hence, by Theorem 3, 
Corollary 1, §17, we can find a solution of these equations in which 
all the unknowns are not zero. Let (y,, --- y,) be such a solution and 
denote by y}, y/ the values of 2}, x when the constants y,, --- y, are 
substituted in them for the variables ,,.--z,. Substituting the y’s 
for the 2’s in (3) gives 


eyft --- +ey2= —|k,.\y/24— -- —[F-lyy?. 


The expression on the left cannot be negative, and that on the right 
cannot be positive, hence they must both be zero; and this is pos- 
sible only if 

- y= =H =O. 


But by (4) we also have yi4,= °° =Y,= 


That is, (41, ---Y,) is a solution, not composed exclusively of zeros, of 
the system of m homogeneous linear equations in m unknowns, 


pe pes Sih ae 
a= 0, 1 = 0,--- 2,= 0. 


The determinant of these equations must therefore be zero, that is, 
the linear transformation which carries over the 2’s into the vs must 
be a singular transformation. We are here led to a contradiction, 
and our theorem is proved. 

L 
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We can thus associate with every real quadratic form two in- 
tegers P and WN, namely, the number of positive and negative coeffi- 
cients respectively which we get when we reduce the form by any 
real non-singular linear transformation to the form (1). These two 
numbers are evidently arithmetical invariants of the quadratic form 
with regard to real non-singular linear transformations, since two 
real quadratic forms which can be transformed into one another by 
means of such a transformation can obviously be reduced to the same 
expression of form (1).* ; 

The two arithmetical invariants P and NV which we have thus 
arrived at, and the arithmetical invariant r which we had before, are 
not independent since we have the relation 


(5) P4+WNer. 


One of the invariants P and W is therefore superfluous and either 
might be dispensed with. It is found more convenient, however, 
to use neither P nor JN, but their difference, 


(oe s=P_WN, 


which is called the signature of the quadratic form. 


DeEFINITION. By the signature of a real quadratic form is under- 
stood the difference between the number of positive and the number of 
negative coefficients which we obtain when we reduce the form by any 
real non-singular linear transformation to the form (1). 


Since the integers P and WV used above were arithmetical invari- 
ants, their difference s will also be an arithmetical invariant. It 
should be noticed, however, that s is not necessarily a positive in- 
teger. We have thus proved 


THEOREM 3. The signature of a quadratic form is an arithmetical 
invariant with regard to real non-singular linear transformations. 


EXERCISES 


1. Prove that the rank r and the signature s of a quadratic form are either 
both even or both odd; and that ; 
—rss<r. 

2. Prove that any two integers r and s (r positive or zero) satisfying the con- 
ditions of Exercise 1 may be the rank and signature respectively of a quadratic 
form. 

* P is sometimes called the index of inertia of the quadratic form. 


i 


—" 
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3. Prove that a necessary and sufficient condition that a real quadratic form 
of rank r and signature s be factorable into two real linear factors is that 
either (pee 
or R= 2. sie 0s 


4. A quadratic form of rank r shall be said to be regularly arranged (cf. § 20, 
Theorem 4) if the «’s are so numbered that no two consecutive A’s are zero in the set 


@1) U9 


A a A =a A = 
Ct eS Vesa, 





fe Ae one 





and that A,=+0. Prove that if the form is real and any one of these A’s is zero, 
the two adjacent A’s have opposite signs. 


[Succxstion. In this exercise and the following ones, the work of § 49 should be 
consulted. ] 


5. Prove that the signature of a regularly arranged real quadratic form is 
equal to the number of permanences minus the number of variations of sign in the 
sequence of the A’s, if the A’s which are zero are counted as positive or as nega- 
tive at pleasure. 


6. Defining the expression sgn x (read signum x) by the equations 
sgnz= +1 x>0, 
ssn 0) C10; 


senz=—1 B20; 


show that the signature of a regularly arranged real quadratic form of rank r is 
sgn (A,4,) + sgn(A,4,) +--+ sgn (Ay-14,). 


51. Classification of Real Quadratic Forms. We saw in the last 
section that a real quadratic form has two invariants with regard to 
real non-singular linear transformations, — its rank and its signa- 
ture. The main result to be established in the present section 
(Theorem 2) is that these two invariants form a complete system. 

If in §46 the e’s and f’s are real, the transformation (2) will be 
real when, but only when, each ¢ has the same sign as the corre- 
sponding #. All that we can infer from the reasoning of that section 
now is, therefore, that if a real quadratic form of rank r can be 
reduced by a real non-singular linear transformation to the form 


Cat + + + 6,22, 


it can also be reduced by a real non-singular linear transformation to 
the form yah ++ + kya, 
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where the ’s are arbitrarily given real constants, not zero, subject 
to the condition that each k has the same sign as the corresponding e. 
Using the letters P and YW for the number of positive and negative e's 
respectively, the transformation can be so arranged that the first 
P es are positive, the last V negative. Accordingly the first P k’s 
can be taken as +1, the last Vas —1. From equations (5) and (6) 
of § 50, we see that P and WV may be expressed in terms of the rank 
and signature of the form by the formule 


; ey 
©) ee: 2: 





Thus we have the theorem : 


THEOREM 1. A real quadratic form of rank r and signature s can be 
reduced by a real non-singular linear transformation to the normal form 


where P is given by (1). 


We are now able to prove the fundamental theorem : 


THEOREM 2. A necessary and sufficient condition that two real 
quadratic forms be equivalent with regard to real non-singular linear 
transformations ts that they have the same rank and the same signature. 

That this is a necessary condition is evident from the invariance 
of rank and signature. That it is sufficient follows from the fact 
that if the two forms have the same rank and signature, they can 
both be reduced to the same normal form (2). 


DerFinition. All real quadratic forms, equivalent with regard to 
real non-singular linear transformations to a given form, and therefore to 
each other, are said to form a class.* 

Thus, for instance, since every real non-singular quadratic form 
in four variables can be reduced to one or the other of the five 


normal forms, 
We ne a 


af + a3 + 22 — 2}, 

(3) at + a2 — 22 — 22, 
ot — 2 — 02 — 23, 

— af = 2h — of — 2h 


* This term may be used in a similar manner whenever the conception of equiva- 
lence is involved. 
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we see that all such forms belong to one or the other of five classes 
characterized by the values 


eat? Ue 2 4, y= 4, 


If, however, as is the case in many problems in geometry, we are 
concerned not with quadratic forms, but with the equations obtained 
by equating these forms to zero, the number of classes to be distin- 
guished will be reduced by about one half, since two equations are 
the same if their first members differ merely in sign. 

Thus there are only three classes of non-singular quadric surfaces 
with real equations, whose normal forms are obtained by equating 
the first three of the forms (3) to zero. These equations written in 
non-homogeneous codrdinates are 


een oraz 1, 
X04 724.7% = 1, 
Xe V2 72 = 1. 


The first of these represents an imaginary sphere, the second a real 
sphere, and the third an unparted hyperboloid generated by the revolu- 
tion of a rectangular hyperbola about its conjugateaxis. It may readily 
be proved that this last surface may also be generated by the revolution 
of either of the lines Vereen 7, 


about the axis of Z. We may therefore say: 


THEOREM 38. There are three, and only three, classes of non-singular 
quadric surfaces with real equations. In the first the surfaces are imag- 
inary ; in the second real, but their rulings are imaginary ; in the third 
they are real, and the rulings through their real points are real.* 


This classification is complete from the point of view we have 
adopted of regarding quadric surfaces as equivalent if one can be 
transformed into the other by a real non-singular collineation. The 
more familiar classification does not adopt this projective view, but 
distinguishes in our second class between ellipsoids, biparted hyper- 
boloids, and elliptic paraboloids ; and in the third class between un- 
parted hyperboloids and hyperbolic paraboloids. 


*Tf, as here, we consider not real quadratic forms, but real homogeneous quadratic 
equations we must use, not s, but |s| as an invariant. In place of |s| we may use what is 
known as the characteristic of the quadratic form, that is the smaller of the two in- 
tegers P, N. This characteristic is simply 4(r — |s|). 
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EXERCISES 


1. Prove that there are 1(n + 1) (n + 2) classes of real quadratic forms in n 
variables. 


2. Give a complete classification of singular quadric surfaces with real equa- 
tions from the point of view of the present section. 


52. Definite and Indefinite Forms. 


Derinition. By an indefinite quadratic form is understood a real — 
quadratic form such that, when it is reduced to the normal-form (2), — 


$51, by a real non-singular linear transformation, both positive and neg- 
ative signs occur. All other real quadratic forms are called definite ;* 


and we distinguish between positive and negative definite forms accord- 


ing as the terms in the normal form are all positive or all negative. 


In other words, a real quadratic form of rank r and signature s 
is definite if s= +7, otherwise it is indefinite. t 

The names definite and indefinite have been given on account of 
the following fundamental property: 


THEOREM 1. An indefinite quadratic form is positive for some real 
values of the variables, negative for others. A positive definite form is 
positive or zero for all real values of the variables; a negative definite 
form, negative or zero. 


The part of this theorem which relates to definite forms follows 
directly from the definition. To prove the part concerning indefi- 
nite forms, suppose the form reduced by a real non-singular linear 
transformation to the normal form 


(1) oP bo bah aly — al 


Regarding the 2'’s as abbreviations for certain real linear forms in 
the 2’s, let us consider the system of n—P homogeneous linear equa- 
tions 

kets aes hs 
(2) Upp, = 0, fp4,=0, «>a, = 0. 


Since these equations are real, and their number is less than the - 


number of unknowns, they have a real solution not consisting 


* Some writers reserve the name definite for non-singular forms, and call the 
singular definite forms semidefinite. 

t Otherwise stated, the condition for a definite form is that the characteristic be 
zero. Cf. the footnote to Theorem 8, § 51. 
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exclusively of zeros. Let (y,, -:: y,) be such a solution. This 
solution cannot satisfy all the equations 


(3) Diese. vp e= (), 


for equations (2) and (3) together form a system of n homogeneous 
linear equations in n unknowns whose determinant is not zero, since 
it is the determinant of the linear transformation which reduces the 
given quadratic form to the normal form (1). Accordingly, if we 
substitute (y,, --- y,) for the variables (z,, --- z,) in the given quad- 
ratic form, this form will have a positive value, as we see from the 
reduced form (1). 

Similarly, by choosing for the z’s a real solution of the equations 


Ba 0,-..a; = 0; cere rh Peewos= 18 
which does not consist exclusively of zeros, we see that the quad- 
ratic form takes on a negative value. 
We pass now to some theorems which will be better appreciated 


by the reader if he considers their geometrical meaning in the 
case n= 4. 


THEOREM 2. Jf an indefinite quadratic form is positive at the real 
point (yy, °* Yn) and negative at the real point (21, +++ @), then there 
are two real points linearly dependent on these two, but linearly inde- 
pendent of each other, at which the quadratic form is zero, and neither 
of which is a vertex of the form. 


The condition that the quadratic form 
(4) LUyvim; 
vanish at the point (y, + Az, +++ Yn + AZ,) 18 
Zayyiys + 2 rAZayye; + MRayee; = 0. 


This quadratic equation in » has two real distinct roots, since, 
from our hypothesis that (4) is positive at y and negative at z, it 
follows that 


n 2 n n 
(2ayyz,) - (Sasy) (Sa,2;) 70. 
Let us call these roots A, and A,. Then the points 


(5) (yy + Ayo Yn + AVZn)s (Yz + eer ot Yn + een) 
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are two real points linearly dependent on the points y and z at which 
(4) vanishes. 
Next notice that 
Yi tr Mei Yj + My; 
Yat Moe Yj 1 Age; 


Yi Y; : 


i 2; 


(8) 


a NS 














Since the points y and z are linearly independent, the integers %, 7 
can be so chosen that the last determinant on the right of (6) is not 
zero. Then the determinant on the left of (6) is not zero; and, 
consequently, the points (5) are linearly independent. 

In order, finally, to prove that neither of the points (5) is a 
vertex, denote them for brevity by 


(Yu Yn (4p ++ Sn). 


Letting X, — A, = 1/H, we have 


% = BY; — pZ, (@ = 1, 2, --- ). 
Therefore 
(7) Bayee; = WLay VY, — 2 pPRayVZ, + WP Lay ZZ, 


Since the points Y and Z have been so determined that (4) vanishes 
at them, the first and last terms on the right of (7) are zero. If 
either Y or Z were a vertex, the middle term would also be zero ; 
but this is impossible since the left-hand member of (7) is, by 
hypothesis, negative. Thus our theorem is proved. 

For the sake of completeness we add the corollary, whose truth 
is at once evident : 


CoROLLARY. The only points linearly dependent on y and z at 
which the quadratic form vanishes are points linearly dependent on one 
or the other of the points referred to in the theorem; and none of these 
are vertices. 


We come now to a theorem of fundamental importance in the 
theory of quadratic forms. 


THEOREM 3. A necessary and sufficient condition that a real 
quadratic form be definite is that it vanish at no real points except its 
vertices and the point (0, 0, --- 0). 
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Suppose, first, that we have a real quadratic form which vanishes 
at no real points except its vertices and the point (0, 0, --- 0). If it 
were indefinite, we could (Theorem 1) find two real points y, 2, at 
one of which it is positive, at the other, negative. Hence (Theorem 
2) we could find two real points linearly dependent on y and g, at 
which the quadratic form vanishes. Neither of these will be the 
point (0, 0, --- 0), since, by Theorem 2, they are linearly independ- 
ent. Moreover, they are neither of them vertices. Thus we see 
that the form must be definite, and the sufficiency of the condition 
is established. 

It remains to be proved that a definite form can vanish only at 
its vertices and at the point (0, 0, --- 0). 

Suppose (4) is definite and that (y,, --- y,) is any real point at 
which it vanishes. Then, 


Laila; + Ayi)(a; + AY;) = LAU; Jk, & ADA ili. 


If y were neither a vertex nor the point (0, 0, --- 0), Za,a,y; would 
not vanish identically, and we could find a real point (¢,, --- 2,) such 


that n 
a Lape; iP 


If we let c= Saye 
we have 
(8) Lae + yi)(Z; + yj) = ¢ + 2k. 


For a given real value of A, the left-hand side of this equation 

is simply the value of the quadratic form (4) at a certain real point. 

. Accordingly, for different values of 2 it will not change sign, while 

the right-hand side of (8) has opposite signs for large positive 

and large negative values of X. Thus the assumption that y was 

neither a vertex nor the point (0, 0, --- 0) has led to a contradiction; 
and our theorem is proved. 


CoROLLARY. A non-singular definite quadratic form vanishes, 
for real values of the variables, only when its variables are all zero. 


Asa simple application of the last corollary we will prove 


THEorEM 4. In a non-singular definite form, none of the coeffi- 
cients of the square terms can be zero. 
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For suppose the form (4) were definite and non-singular; and 
that a, = 0. Then the form would vanish at the point 


Ce a a 0, a=1; 


and this is impossible, since this is not the point (0, 0, --- 0). 


EXERCISES 
1. DeriniTion. By an orthogonal transformation * is understood a linear trans- 
formation which carries over the variables (21)--- Zp) into the variables (x{,---%,) in such 
a way that 


H2 + a2 + oe + etal? + OB 4 oe + 2, 
Prove that every orthogonal transformation is non-singular, and, in particular, 
that its determinant must have the value + 1 or — 1. 
2. Prove that all orthogonal transformations in n variables form a group; and 
that the same is true of all orthogonal transformations in n variables of deter- 
minant +1. 


3. Prove thata necessary and sufficient condition that a linear transforma- 
tion be orthogonal is that it leave the “ distance” 


between every pair of points (y1, ++: Yn), (21) +++ Zn) Invariant. 

4. Prove that if n = 38, and if 21, 2, 23 be interpreted as non-homogeneous 
rectanglar coérdinates in space, an orthogonal transformation represents either a 
rigid displacement which leaves the origin fixed, or such a displacement combined 
with reflection in a plane through the origin. 

Show that the first of these cases will occur when the determinant of the 
transformation is + 1, the second when this determinant is — 1. 


5. If the coefficients of a linear transformation are denoted in the usual way 
by cy, prove that a necessary and sufficient condition that the transformation be 


orthogonal is that Cuber +e. te= 1 (G@=1,2,--n), 
5 12) 
C14 C1y + CaxCa; + +++ + CniCny = O | 3 Voz]. 
gH 1,2, 2m 


Show that these will still be necessary and sufficient conditions for an orthogonal 
transformation if the two subscripts of every c be interchanged.t 


* The matrix of such a transformation is called an orthogonal matrix, and its deter- 
minant an orthogonal determinant. 

+ We have here 4 n (n + 1) relations between the n? coefficients of the transforma- 
tion. This suggests that it should be possible to express all the coefficients in terms of 
n2 — (n+) — n(n—1) 

2 Pe ee 
of them, or if we prefer in terms of }n(m—1) other parameters. For Cayley’s dis- 
cussion of this question cf. Pascal’s book, Die Determinanten, § 47. Cayley’s formule, 
however, do not include all orthogonal transformations except as limiting cases. 


CHAPTER XII 


THE SYSTEM OF A QUADRATIC FORM AND ONE OR MORE 
LINEAR FORMS 


53. Relations of Planes and Lines to a Quadric Surface. If the 
plane 


(1) UL + Une + Ugh, + Ut, = 0 
is a true tangent plane to the quadric surface 

4 
(2) ZA yHie; = 0, 


there will be a point (y1, Ya, Ys, y,) (namely the point of contact) 
lying in (1) and such that its polar plane 


4 


coincides with (1). From elementary analytic geometry we know 
that a necessary and sufficient condition that two equations of the 
first degree represent the same plane is that their coefficients be pro- 
portional. Accordingly, from the coincidence of (1) and (8), we 
deduce the equations 


Ay 1Y 4 + Ay2Yq + %3Yg + 444 — Py = 9; 
(4) Boi Y + Aang + Magy + U4 — PMg = 9, 
Mg1Y1 + AgeYy + Aga + 3444 — Pg = 0, 
Ag1Y1 + Ayo + Vygg + M444 — Py = 9. 
From the fact that the point y lies on (1), we infer the further 
relation 
(9) UY, + Usa + Ugly + Ug y = 9. 

These equations (4) and (5) have been deduced on the suppo- 
sition that (1) is a true tangent plane to (2). They still hold if 
it is a pseudo-tangent plane; for then the quadric must be a cone, 
and a vertex of this cone must lie on(1). Taking the point y as 


this-vertex, equation (5) is fulfilled. Moreover, since now the first 
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member of (3) is identically zero, equations (4) will also be fulfilled 
if we let p=0. Thus we have shown in all cases, that if (1) is a 
tangent plane to (2), there exist five constants, y,, Yq Y3, Yu P» of 
which the first four are not all zero, and which satisfy equations (4) 


and (5). Hence 
(9) A, Ug Wg Aq WY 


Moy 2 %g AUp4 Up 

(6) Ms, Wz gg gy Ug | = 0. 
By, yn, Ag Ogg Uy 
Uy, Uy, Ug wm OY 


Conversely, if this last equation is fulfilled, there exist five 
constants, ¥4, Yo: Ys: Ya P» not all zero, and which satisfy equations 
(4) and (5). We can go a step farther and say that y;, Yo, Y3) V4 
cannot all be zero, as otherwise, from equations (4) and the fact that 
the w’s are not all zero, p would also be zero. ‘Thus we see that if 
equation (6) is fulfilled, there exists a point (y,, Y, Ys, Y4) in the 
plane (1) whose codrdinates, together with a certain constant p, 
satisfy (4). Ifp=0, this shows that the quadric is a cone with y 
as a vertex, and hence that (1) is at least a pseudo-tangent plane. 
If p # 0, equations (4) show us that the polar plane (8) of y coin- 
cides with the plane (1). Moreover we see,:either geometrically, or 
by multiplying equations (4) by y, Ya, Ys Y4 respectively and add- 
ing, that the point y lies on the quadric; so that, in this case, (1) is 
a true tangent plane. 

We have thus established the theorem : 


THEOREM 1. Equation (6) is a necessary and sufficient condition 
that the plane (1) be tangent to the quadric (2). 


It will be seen that this theorem gives us no means of distinguish- 
ing between true and pseudo-tangent planes of quadric cones. In 
the case of non-singular quadrics, pseudo-tangent planes are impos- 
sible, and therefore equation (6) may, in this case, be regarded as the 
equation of the quadric in plane-codrdinates. 

In the case of a quadric surface of rank 3, that is, of a cone with 
a single vertex, the codrdinates (2,, U, Us, U4) of every plane through 
this vertex satisfy equation (6), so that in this case this equation 
represents a single point, and not the quadric cone.* 


* In fact a cone cannot be represented by a single equation in plane-codrdinates. 
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If the rank of (2) is less than 3, the codrdinates of every plane 

in space should satisfy (6), since every such plane passes through a 

vertex and is therefore a tangent plane. This fact may be verified 
by noticing that equation (6) may also be written 


4 
DA yey = 0, 


where the A’s are the cofactors in the discriminant of (2) according 
to our usual notation. 

We pass now to the condition that a straight line touch the 
quadric (2). This line we will determine as the intersection of the 
two planes (1) and 
(7) V4L1 + V—%y + Ug%y + VX, = 0. : 

If the line of intersection of these planes is a true tangent to (2), 
there will be a point (y,, Y>, Ys ¥4)) namely the point of contact, lying 
upon it, and such that its polar plane (3) contains the line. It must there- 
fore be possible to write the equation of this polar plane in the form 


(8) S (uu + VY;) a= 0; 
and, in fact, by properly choosing the constants mw and », the co- 
efficients of (8) may be made not merely proportional, but equal to 
the coefficients of (3): 
By Y1 + AyeYo + Ay3Yg + M44 — Hy — 00, =9; 
Mas Y + AggYq + Angi + n4Y4 — HUly — VV4= 0, 
Ag1Y1 + Agen + MgaYg + Ag4 4— Hg — 1g = 0, 
gy + Appa + Ugg + Vy 4 — Ug — 14 = 9. 

Since the point y lies on the line of intersection of the planes (1) 
and (7), we also have the relations 
(10) SS Ds ata 

VY + aYo + Yat Yass = 9. 

Since the six equations (9) and (10) are satisfied by six constants 

Ys Yo Ya Ya My v not all zero, we infer finally the relation 


(9) 


ie ECTS Sapo aie Ges lem! 
Tage eax Case gd, Uy Vg 

(11) Msi ax S98, 94 Us M8 | 9, 
Wa Va) ag Mig My Ue 
Wi eatin wien tig O° 0 
Viton eee, 0 0 
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We have deduced this equation on the supposition that the line of 
intersection of (1) and (7) is a true tangent to (2). We leave it to 
the reader to show that (11) holds if this line is a pseudo-tangent, 
and also if it is a ruling of (2). 

We also leave it for him to show that if (11) holds, the line of 
intersection of (1) and (7) will be either a true tangent, a pseudo- 
tangent, or a ruling, and thus to establish the theorem: 


THEOREM 2. A necessary and sufficient condition that the line of 
intersection of the planes (1) and (7) be either a tangent or a ruling 
of (2) is that equation (11) be fulfilled. 

On expanding the determinant in (11), it will be seen that it is 
a quadratic form in the six line-codrdinates q, (cf. Exercise 3, § 35). 
Equation (11) may therefore be regarded as the equation of the 
quadric surface in line-codrdinates if the surface is not a cone, or is 
a cone with a single vertex. If the rank of (2) is 2, so that the 
quadric consists of two planes, (11) is the equation of the line of 
intersection of these planes. While if the rank is 1 or 0, (11) is 
identically fulfilled. 

EXERCISES 


1. Two planes are said to be conjugate with regard to a non-singular quadric 
surface if each passes through the pole of the other. 

Prove that if (2) is a non-singular quadric, a necessary and sufficient con- 
dition that the planes (1) and (7) be conjugate with regard to it is the vanishing 


of the determinant 
Q, G2 Ag Ay WY 


Mp 499 Ag Any Up 4 

bs “Wap ish (hye Ae | eo 2A Uj Uj. 
Gy Uy Ugg Uy Uy 

OF ty Gh. Oy © 


How must this definition of conjugate planes be extended in order that this 
theorem be true for singular quadrics also? 


2. Prove that if (2) is a non-singular quadric, a necessary and sufficient con- 
dition that the point of intersection of three planes lie on (2) is the vanishing of 
the seven-rowed determinant formed by bordering the discriminant of (2) with the 
coefficients of the three planes. 


3. Admitting it to be obvious geometrically that a necessary and sufficient con- 
dition that a line touch a non-singular quadric is that the two tangent planes which 
can be passed through this line should coincide, prove that, if (2) is non-singular, 
a necessary and sufficient condition that the line of intersection of (1) and (7) 


touch (2) is 4 4 4 
a c: A quits) e- A yvivj) — CG A yuiv;)? = 0. 


4. Show algebraically that the condition of Exercise 3 is equivalent to (11), 
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54. The Adjoint Quadratic Form and Other Invariants. Passing 
now to the case of n variables, we begin by considering the system 
consisting of a quadratic form and a single linear form 


(1) Ey; Xijp 
(2) Sus. 


The geometrical considerations of the last section suggest that we 
form the expression ey ae 
(3) 2A; U; U; a . ‘ 5 5 
Ani °° Ann Un 
tm = uw, 0 

This, it will be seen, is a quadratic form in the variables (a, --- u,) 
whose matrix is the adjoint of the matrix of (1). We will speak of 
(3) as the adjoint of (1). 

The invariance of (3) is at once suggested by the fact that in the 
case n = 4 the vanishing of (3) gave a necessary and sufficient con- 
dition for a projective relation. In fact we will prove the theorem : 


THEOREM 1. The adjoint form (8) is an invariant of weight two of 
the pair of forms (1), (2). 

Inasmuch as the w’s are, as we saw in § 34, contragredient to the 
z’s, we may also call (3) a contravariant (cf. Definition 2, § 34). 

In order to prove this theorem we must subject the a’s to a linear 
transformation, 


(4) 


! 
Ty = Cy V+ + Cnty 


a i J 
Ly, = Cn VY ee + Cnn Xny 


whose determinant we will call ce. Let us denote by aj, and wu} re- 
spectively the coefficients of the quadratic and linear form into which 
this transformation carries (1) and (2). 

Let us now introduce an auxiliary variable ¢, and consider the 
quadratic form in %, -+- %,, ¢, 


(5) 2ay Dy Ly +2 (Uy + + + Uy Dp) 


The discriminant of this form is precisely the determinant in 
(3), that is, the negative of the adjoint of (1). 
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Let us now perform on the variables 2,---,,¢ the linear trans- 
formation given by formule (4) and the additional formula 


(6) t=. 
The determinant of this transformation is ce, and it carries over the 


form (5) into 3 alate! +2 t(ulat + + + ulx}). 


From the fact that the discriminant of (5) is an invariant of 
weight 2, we infer the relation we wished to obtain : 


Ayo Uy Uy Byy ** Ayn Uy 
ae hi Sy! 
Any *** Un Un Any *** Irn Un 


The method just used admits of immediate extension to the proof 
of the following more general theorem : 


THEOREM 2. If we have a system consisting of a quadratic form in 
n variables and p linear forms, the (n+ p)-rowed determinant formed by 
bordering the discriminant of the quadratic form by p rows and p 
columns each of which consists of the coefficients of one of the linear 
forms is an invariant of weight 2. 

We leave the details of the proof of this theorem to the reader. 

If the discriminant a of the quadratic form (1) is not zero, we may 
form a new quadratic form whose matrix is the inverse of the matrix 
of (1). This quadratic form, which is known as the ¢énverse or 
reciprocal of (1), is simply the adjoint of (1) divided by the discrimi- 
nant a. We will prove the following theorem concerning it: 

THEOREM 3. Jf the quadratic form (1) ts non-singular, it will be 
carried over into its inverse by the non-singular transformation 
(7) Uh = My Ly + ++ + Ain Ly (¢= 1, 2, ---n). 

For we have . ee ny 


But from (7) we have 
ty Bab +e 
a 
and therefore n n A 


as was to be proved. 
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It will be noticed that if (1) is a real quadratic form, the trans- 
formation (7) is real; and from this follows 


THEOREM 4. A real non-singular quadratic form and its inverse 
have the same signature. 


EXERCISES 


1. Given a quadratic form Saj2x;2; and two linear forms Sugx;, 30; 24. 
Prove that 


ergy 20) hi. SER) 
n 
= A Buju; = — 
Any + Ann Un 
vy os Un 0 


is an invariant of the system of weight 2. 


2. Generalize the theorem of Exercise 1 to the case in which we have more 
than two linear forms. 


3. Prove that if a first quadratic form is transformed into a second by the 
linear transformation of matrix c, then the adjoint of the first will be transformed 
into the adjoint of the second by the linear transformation whose matrix is the 
conjugate of the adjoint of c. 


4. Prove a similar theorem for bilinear On ms. 


5. State and prove a theorem for bilinear forms analogous to Theorem 3. 


55. The Rank of the Adjoint Form. Suppose the discriminant a 


of the quadratic form ZO gi; is of rank r, and that the discrimi- 


nant A of its adjoint 3 A,uyu,; is of rank R. Then, if r<n—1, all 


the (n — 1)-rowed Aorerutnants of a are zero; but these are the ele- 
ments of A, hence R=0. If r=n—1, at least one of the elements 
of A is not zero, and all two-rowed determinants of A are zero (since 
by § 11 each of them contains a as a factor), hence R=1. Ifr=n, 
R=n; for if R were less than n we should have A= 0, and there- 
fore a=0 (since A=a"1). But this is impossible, since by hypothesis 
r=n. We have then: 


THEOREM 1. Jf the rank of a quadratic form in n variables and of 
its adjoint are r and R respectively, then 
i 7 =, R=n, 
GFr=n—1, K=1, 
gr<n—1, R=0. 
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Let us consider further the case r=n—1. Here we have seen 
that R =1, that is, that the adjoint is the square of a linear form, 


n n n 
Zi A yuu; = CD = > CCW Uj. 


Comparing coefficients, we see that 


A 


i = CiCje 
All the ¢’s cannot be zero, as otherwise we should have R=0. Let 
¢,.+#0. Then since Ava 2720 

os ork 


we see that not all the quantities (A,,, --- A,,) are zero. Accord- 
ingly (cf. §44) the point (A,y, Aj, --- A,,), and therefore also the 
point (¢,, --: ¢,), is a vertex of the original quadratic form. Thus we 
have the theorem : 


THEOREM 2. If the rank of a quadratic form in n variables isn — 1, 
its adjoint is the square of a linear form, and the coefficients of this 
linear form are the codrdinates of a vertex of the original form. 

Since, in the case we are considering, all the vertices of the 
quadratic form are linearly dependent on any one, this theorem com- 
pletely determines the linear form in question except for a constant 
factor. 


CHAPTER XIIT 
PAIRS OF QUADRATIC FORMS 


56. Pairs of Conics. We will give in this section a short geomet- 
rical introduction to the study of pairs of quadratic forms, confining 
ourselves, for the sake of brevity, to two dimensions. 

Let uw and v be two conics which we will assume to be so situated 
that they intersect in four, and only four, distinct points, A, B, C, D. 
Consider all conics through these four points. These conics, we will 
















2 
LKR 


say, form a pencil. It is obvious that there are three and only three 
singular conics (7.e. conics which consist of pairs of lines) in this 
pencil, namely, the three pairs of lines AB, CD; BC, DA; AO, BD. 
Let us call the “vertices” of these conics P, Q, and # respectively. 

From the harmonic properties of the complete quadrilateral* we 
see that the secants PAB and PCD are divided harmonically by the 


* Cf. any book on modern geometry. 
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line QR. Accordingly QF is the polar of P with regard to every 
conic of the pencil. Ina similar manner PA is the polar of Q, and 
PQ the polar of R with regard to every conic of the pencil. Thus, 
we see that the triangle PQR is a self-conjugate triangle (see § 41) 
with regard to every conic of the pencil. Accordingly, if we per- 
form a collineation which carries over P, Q, R into the origin and 
the points at infinity on the axes of @ and y, the equation of every 
conic of the pencil will be reduced to a form in which only the 
square terms enter. We are thus led to the result: 


THEOREM. Jf two conics intersect in four and only four distinct 
points, there exists a non-singular collineation which reduces their 
equations to the normal form 


Ax? + Az? + Av? = 0, 
Bx? + Box} + Bx} = 0. 


If we wish to carry through this reduction analytically, we shall 
write the equations of the two conics w and v in the forms 


(1) Saye; = 0, Sb ye, => 0. 


The pencil of conics may then be written 
3 
(2) i (uy — Abia yw; = 0, 


or rather, to be accurate, this equation will represent for different 
values of X all the conics of the pencil except the conic v. The 
singular conics of the pencil will be obtained by equating the 
discriminant of (2) to zero, 


Ay, — Dy, yy — Nyy yz — N43 
(3) Ay — Abo, Aggy — Nog og — bys |= 0. 
a1 — Adz, Aza — bgp A33— Age 


This equation we will call the 2- -equation of the two conics. 
When expanded, it takes the form 


(4) —AN +O? Or +A=0, 
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where A, A’ are the discriminants of w and v respectively, and 


LQ, Aq Ogg Ay, Oy Ag hae M19 43 
=]q1 Ag, O93) + |G, oq gg | + 


As, Azg  Ogs 3, Og5 agg 


bo, gg Ag ’ 





ai. en" ge 

while ©’ can be obtained from © by an interchange of the letters a 
and 6. It can readily be proved (cf. the next section) that the co- 
efficients ® and ©’ as well as Aand A’ are invariants of weight two. 

Except when the discriminant A’ of v is zero, the equation (4) 
is of the third degree, and its three roots, which in the case we 
have considered must evidently be distinct, give, when substituted 
in (2), the three singular conics of the pencil. 

We will not stop here to show how the theory of any two 
conics, where no festriction as to the number of points of intersec- 
tion is made, can be deduced from equation (3).* This will follow 
in Chapter XXII as an application of the method of elementary 
divisors. Our only object in this section has been to give a geo- 
metrical basis for the appreciation of the following sections. 


57. Invariants of a Pair of Quadratic Forms. Their \-Equation. 
We consider the pair of quadratic forms 


n 
P( 2) °+* Lp) = ZU yjUiLjp 


Vp hy) = Eb eat 
and form from them the pencil of quadratic forms 
o—Ap= (ay — big wi. 
The discriminant of this pencil, 
yy — yy 2° Ayn — My 
eee =F), 
re 7 oe “+ Ann — Onn 
is a polynomial in X which is in general of degree n, and which may 
be written FQ)=8,-—O,r+ + +(—170,.". 


* An elementary discussion of the \-equation of two conics (l’équation end) is 
regularly given in French text-books on analytic geometry. See, for instance, Briot 
et Bouquet, Lecons de Géométrie analytique, 14th ed., p. 849, or Niewenglowski, Cowrs 
de Géométrie analytique, Vol. I, p. 459. 
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The coefficients of this polynomial are themselves polynomials in 
the a,’s and 6,’s, ®, and ®, being merely the discriminants of ¢ and 
a respectively, while @, is the sum of all the different determinants 
which can be formed by replacing & columns of the discriminant of 
¢ by the corresponding columns of the discriminant of . 


THEOREM 1. The coefficients @,, --- ©, of F(A) are integral rational 
invariants of weight two of the pair of quadratic forms ¢, .* 


In order to prove this, let us consider a linear transformation of 
determinant ¢ which carries over ¢ and > into ¢/ and W’ respec- 


tively, where yes : 
p= => Uy wi eh, 


p! = 24, al. 


Let us denote by ©! the polynomial in the aj,’s and b/s obtained by 
putting accents to the a’s and 0’s in @,. Our eoren will then be 
proved if we can establish the identities 


0! = 20, 7=0,1,---n 
a a ( 9 ) 


This follows at once from the fact that F(A), being the discriminant 
of @ — Ay, is an invariant of weight two, so that if we denote by 
F(A) the discriminant of ¢' — Aw’, we have . 


F'(’) = 2 F(A). 


This being an identity in as well as in the a’s and 0's, we can 
equate the coefficients of like powers of » on the two sides, and this 
gives precisely the identities we wished to establish. + 

The equation F(a) =0 


we will call the \-equation of the pair of forms ¢,y. Since, as we 
have seen, #’ is merely multiplied by a constant different from zero 
when ¢ and w are subjected to a non-singular linear transformation, 


* Cf. Exercise 13, § 90. 

+ The method by which we have here arrived at invariants of the system of two 
quadratic forms will be seen to be of very general application. If we have an integral 
rational invariant 7 of weight u of a single form of the kth degree in n variables, we can 
find a large number of invariants of the system 94, $2, +++, of p forms of the kth degree 
in n variables by forming the invariant J for the form 141 + +++ +Ap,dp. This will be 
a polymonial in the )’s, each of whose coefficients is seen, precisely as above, to be an 
integral rational invariant of the systems of ¢’s of weight yu. 
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the roots of the A-equation will not be changed by such a transfor- 
mation. These roots, hosever, are irrational functions of the @’s and 
hence of the a’s and 6’s. We may therefore state the result : 


THEOREM 2. The roots of the d-equation of a pair of quadratic 
Forms are absolute irrational invariants of this pair of forms with 
regard to non-singular linear transformations. 

It is clear that the multiplicity of any root of. the X-equation 
will not be changed by a non-singular linear transformation. Hence 


THEOREM 3. The multiplicities of the roots of the d-equation are 
arithmetical invariants of the pair of quadratic forms with regard to 
non-singular linear transformations. 


If b = ar + + 4,22, 
ps ait te + 23, 
the roots of the A-equation are a,,-:-a,. This example shows 
that the absolute invariants of Theorem 2 may have any values, 
and also that the arithmetical invariants of Theorem 8 are subject 


to no other restriction than the obvious one of being positive in- 
tegers whose sum is n. 


58. Reduction to Normal Form when the \-Equation has no Multi- 
ple Roots. Although our main concern in this section is with the 
case in which the A-equation has no multiple roots, we begin by estab- 
lishing a theorem which applies to a much more general case. 


THEOREM 1. Jf 2, is a simple root of the r-equation of the pair 
db, wv of quadratic forms in n variables, then, by a non-singular linear 
transformation, p and can be reduced respectively to the forms 


(1). hae a Py (2 n+ By 
OI ad Wy (Za) aa Zn) 


where c, is a constant not zero and d4, W, are quadratic forms in the 
n—1 variables 2, +++ 2y- 


To prove this, we will consider the pencil of forms 


(2) p—Aps=h—AyW + Ay — A). 
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Since d, isa root of the A-equation of the pair of forms ¢, yp, the form 
—,\p is singular, and can therefore, by a suitable non-singular linear 
transformation, be written in a form in which one of the variables, say 


vi, does not enter, b—up=?¢' (ah, --- wh). 


If this transformation reduces yp to w’, we have 
(3) P— Aap = Pi (yy ++ Bh) + Aq — A)W' (Cp +> Mn). 

The discriminant of the quadratic form which stands here on the 
right cannot contain A, —A as a factor more than once, since A, is, by 
hypothesis, not a multiple root of the A-equation of dand y. It 
follows from this that the coefficient of x? in the quadratic form w’ 
cannot be zero, for otherwise the discriminant of the right-hand side 
of (3) would have a zero in the upper left-hand corner, and A,—” 
would be a factor of all the elements of its first row and also of its 
first column ; so that it would contain the factor (A, — dA). 

Since the coefficient of x2 in W’ is not zero, we can by Lagrange’s 
reduction (Formule (2), (3), §45) obtain a non-singular linear trans- 
formation of the form 

2 = T+ Yat + + nth 
fy = Lp 


a= a 
which reduces w’ to the form 
C125 + Wy (2a) -+* 2p) (¢ #0). 
This transformation carries over the second member of (3) into 
! (Zap + Za) + (Ay — 2) Wy (Zap +++ Zn) + AY — AY EyAP. 
Combining these two linear transformations and writing 
P! (259 *** Zn) + AY Wy (Zap o> 2a) = Py(Zo9 ** En)s 


we have thus obtained a non-singular linear transformation which 
effects the reduction, 
P (yy °° Yq) — p(y ++ iy) = hy (Zap 2 @n)— aby (ay +++ Zn) + (Ay) C23 


if, here, we equate the coefficients of X% on both sides, and the 
terms independent of X, we see that we have precisely the reduc- 
tion of the forms ¢, y to the forms(1); and the theorem is proved. 


PAIRS OF QUADRATIC FORMS 169 


Let us now assume that the form > is non-singular, thus insur- 
ing that the A-equation be of degree n. We will further assume 
that the roots d,, A,, “+ A, of this equation are all distinct. We can 
then, by the theorem just proved, reduce the forms ¢, y to the forms 
(1) by a non-singular linear transformation. The d-equation of these 
two forms is seen to differ from the A-equation of the pair of forms 
in (n—1) variables ¢,, yr, only by the presence of the extra factor 
-,—»A. Accordingly the A-equation of the pair of forms ¢,, yr, has 
as its roots A,, --- AX, and these are all simple roots. We may there- 
fore apply the reduction of Theorem 1 to the two forms @¢,, w, and 
thus by a non-singular linear transformation of 2,, --- 2, reduce them 
to the forms Ny ln2ee + ho (2h, “* Zh)s 

Cy2ee + Wo(25 ++ Zn) 
This linear transformation may, by means of the additional formula 
Zi 2), 
be regarded as anon-singular linear transformation of 2,, --- 2, which 
carries over ¢, Y into the forms 
Dy 2 + Agegeg + hy (2h --* 2n)s 
CAP + Cy 252 +-o( eb ++ Sn)» 
Proceeding in this way, we establish the theorem: 

THeorEeM 2. If 6, are quadratic forms in (a1, +++ £,) of which the 
second is non-singular, and if the roots d4, +++», of their A-equation are all 
distinct, there exists a non-singular linear transformation which carries 
over p and rp into Ny OB Hg Cg te Hos Hn On Ue 

oe a Gap | c, a2 
respectively, where ¢4,-+- ¢, are constants all different from zero. 


Since none of the c’s are zero, the linear transformation 
al! = Ve, wh G=1, 2.2.2") 
is non-singular. Performing this transformation, we get the further 
result: 


TurorEM 3. Under the same conditions as in Theorem 2, 6 and 
wp may be reduced by means of a non-singular linear transformation to 


the normal forms yA + gah te +g 22, 


mt ate + Oh 
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From this we infer at once 


THEOREM 4. Jf in the two pairs of quadratic forms , and ¢’, 
wy’ the forms and w' are both non-singular, and if the r-equations of 
these two pairs of forms have no multiple roots, a necessary and suf- 
ficient condition for the equivalence of the two pairs of forms is that 
these two A-equations have the same roots ; or, what amounts to the same 
thing, that the invariants @, @,, --- ©, of the first pair of forms be pro- 
portional to the invariants ©), ©}, --- Of, of the second. 


EXERCISE 


Prove that, under the conditions of Theorem 3, the reduction to the normal 
form can be performed in essentially only one way, the only possible variation 
consisting in a change of sign of some of the 2’s in the normal form. 


59. Reduction to Normal Form when wW is Definite and Non- 
singular. We now consider the case of two real quadratic forms 
¢, yw of which wy is definite and non-singular. Our main problem is 
to reduce this. pair of forms to a normal form by means of a real 
linear transformation. For this purpose we begin by proving 


THEOREM 1. The d-equation of a pair of real quadratic forms 
g, wv can have no imaginary root if the form is definite and non- 
singular. 

For, if possible, let « + 62 (« and 8 real) be an imaginary root of 
this X\-equation, sothat 8 #0. Then 6—aew —iGwy will bea singular 
quadratic form, and can therefore be reduced by a non-singular 
linear transformation 


UL = (Pay + 2941) p+ + Din t+ Gin) Las 
n= (Pat + 19 ni) 2+ ++ + (Pan + 4Gnn) Sn 


to the sum of & squares, where k < n, 


(1) p—ap—iByp=rr+oR +. +22, 
Let 

(2) Yi= PyX% + +++ +Pm%ns 

(3) = WX + + +9m2n5 


so that Uy = Yi + ty. 
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By equating the coefficients of 7 on the two sides of (1) we thus get 
(4) — Bps2 yey + 2Yy eto +2, %- 


Let us now determine ,, --- z, so as to make the right-hand side 
of (4) vanish, for instance by means of the equations 


N=Y= 0 = Y=%. 

A reference to (2) shows that we have here a system of & real 
homogeneous linear equations in n unknowns, so that real values of 
24, +*- 2, not all zero can be found satisfying these equations. For 
these values of the variables, we see from (4) that w vanishes; but 
this is impossible (cf. the Corollary of Theorem 8, § 52), since w is by 
hypothesis non-singular and definite. 


THEOREM 2. Jf w is a non-singular definite quadratie form and > 
ts any real quadratic form, the pair of forms >, ~ can be reduced by a 
real non-singular linear transformation to the normal form 


(5) ie + Aya? + a +A, U7), 


apse’ ( ae aus at ap), 
where 4, -++* Xn are the roots of the r-equation, and the upper or lower 


sign is to be used in both cases according as is a positive or a negative 
form. 


The ect of this theorem is very similar to the proof of 
Theorem 2, § 58. We must first prove, as in Theorem 1, §58, 
that ¢, yw can be reduced by a real non-singular linear transforma- 


tion to the forms 
pce (2q5 +++ 2 Zn) (c, #0) 
6 
a cst + Wa (tay tn): 


To prove this, we consider the pencil of forms 
$—APSo-—Aypty—r)¥- 


Since d, is real by Theorem 1, 6—A,> is a real singular quad- 
ratic form, and can therefore by a real non-singular linear trans- 
formation be reduced to a form in which one of the variables does 


not enter 
f — Ay = $' (x +++ Lp). 
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If this transformation reduces w to wz’, we have 
(7) P— App = $! (ayy +++ Gy) + (Ay — A) WP" (Why ++ Tn): 


At this point comes the essential difference between the case 
we are now considering and the case considered in § 58, as A, may 
now be a multiple root of the discriminant of the right-hand side 
of (7). We need, then, a different method for showing that the 
coefficient of v2 in y’ is not zero. For this purpose it is sufficient 
to notice that y, and therefore also W, is a non-singular definite 
form, and that accordingly, by Theorem 4, § 52, the coefficient of 
none of the square terms in w’ can be zero. 

Having thus shown that the coefficient of 22 in w’is not zero, 
we can apply Lagrange’s reduction to w’, and thus complete the 
reduction of the forms ¢, wy to the forms (6) precisely as in the proof 
of Theorem 1, § 58, noticing that the transformation we have to deal 
with is real. 

In (6), $y, Wy, are real quadratic forms in the n—1 variables 
Zo) +++ qe Moreover, since 


Wr (Byy 01+ By) = Cy 24 + Wy (gy ++ Sn) 


is non-singular and definite, it follows that the same is true of wy. 
For, if , were either singular or indefinite, we could find values 
of 2, -+-2, not all zero and such that y,=0; and these values to- 
gether with the value z;=0 would make ~y=0. This, however, is 
impossible by the Corollary of Theorem 3, § 52. 

The A-equation of the two forms ¢,, yw, evidently differs from 
the A-equation of ¢, w only by the absence of the factorA—A,. The 
roots of the A-equation of ¢,, W, are therefore d,, ---r,, so that if we 
reduce , and yy, by the method already used for ¢, w (we have just 
seen that ¢,, Wy, satisfy all the conditions imposed on ¢, ), we get 


Dy (2gy +++ 2) Sy Cy 20” + hy (Zhs ++ 2n)s 
Wil2ay +++ Zn) 0320? + Wro(25y --- 2) 


Proceeding in this way, we finally reduce ¢, by a real non-singu- 
lar linear transformation to the forms 
(8) 1 

VS GU ye 
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Since w is definite, the constants Cy, +++ ¢, are all positive or all nega- 
tive according as y is a positive or a negative form. By means of 
the further non-singular real linear transformation 


the forms (8) may be reduced to the forms (5), and our theorem is 
proved. 


EXERCISES 


1. If ¢ isa real quadratic form in n variables of rank r, prove that it can be 
reduced by a real orthogonal transformation in n variables to the form 
C0) + 6,034 oes + Cp Eps 
Cf. Exercises, § 52. 


2. Show that the determinant of the orthogonal transformation of Exercise 1 
may be taken at pleasure as + 1 or —1. 


3. Discuss the metrical classification of real quadric surfaces along the 
following lines: 

Assume the equation in non-homogeneous rectangular coérdinates, and show 
that by a transformation to another system of rectangular coérdinates having the 
same origin the equation can be reduced to a form where the terms of the second 
degree have one or the other of the five forms (the A’s being positive constants) 


2 2 2 
A,%j + Agr + Aga, 


2 2 2 
A4x; + Aguy — A,%3, 


2 2 
Ayxvjt+ Agn3, 
2) 2 
A 21 — Agtp, 
2 
Aj}. 


Then simplify each of the non-homogeneous equations thus obtained by further 
transformations of coérdinates; thus getting finally the standard forms of the 
equations of ellipsoids, hyperboloids, paraboloids, cones, cylinders, and planes 
which are discussed in all elementary text-books of solid analytic geometry. 


‘CHAPTER XIV 
SOME PROPERTIES OF POLYNOMIALS IN GENERAL 


60. Factors and Reducibility. In the present section we will 
introduce certain conceptions of fundamental importance in our 
subsequent work. 


DEFINITION 1. By a factor or divisor of a polynomial f in any 
number of variables is understood a polynomial b which satisfies an 
identity -of the form f= ov, 

a being also a polynomial. 


It will be noticed that every constant different from zero is a 
factor of every polynomial; that every polynomial is a factor of a 
polynomial which vanishes identically; while a polynomial which 
is a mere constant, different from zero, has no factors other than 
constants. 

We note also that a polynomial in a,, --- x, which is not identically 
zero cannot have as a factor a polynomial which actually contains 
any other variables. 

The conception of reducibility, which we have already met in 
a special case (§47), we define as follows: 


DEFINITION 2. A polynomial is said to be reducible if it is iden- 
tically equal to the product of two polynomials neither of which is a 
constant. 


In dealing with real polynomials, a narrower determination of 
the conception of reducibility is usually desirable. We consider, 
then, what we will call reducibility in the domain of reals, a con- 
ception which we define as follows: 


DEFINITION 3. A real polynomial is said to be reducible in the 
domain of reals if it is identically equal to the product of two other 
real polynomials neither of which is a constant. 
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In many branches of algebra still another modification of the 
conception of reducibility plays an important part. In order to 
explain this, we first lay down the following definition: 


DEFINITION 4. A set of numbers is said to form a domain of 
rationality if, when a and 6b are any numbers of the set, a+ 6, 
a—6, ab, and, so far as b#0, a/b are also numbers of the set. 

Thus all numbers, real and imaginary, form a domain of ration- 
ality, and the same is true of all real numbers. The simplest of all 
domains of rationality, apart from the one which contains only the 
single number zero, is what is known as the natural domain, that is all 
rational numbers, positive and negative. A more complicated domain 
of rationality would be the one consisting of all numbers of the form 
a+6~—1, where a and 6 are not merely real, but rational. These 
illustrations, which might be multiplied indefinitely, should suffice to 
make the scope of the above definition clear.* 


DEFINITION 5. A polynomial all of whose coefficients lie in a 
domain of rationality R is said to be reducible in this domain if tt is 
identically equal to a product of two polynomials, neither of which is a 
constant, whose coefficients also le in this domain. 

It will be noticed that Definitions 2 and 3 are merely the special 
cases of this definition in which the domain of rationality is the 
domain of all numbers, and the domain of all reals respectively. To 
illustrate these three definitions, we note that the polynomial z+ 1 
is reducible according to Definition 2, since it is identically equal to 
(a+ V—1) (x— vice 1). It is, however, not reducible in the domain 
of reals, nor in the natural domain. On the other hand, z?— 2 is 
reducible in the domain of reals, but not in the natural domain. 
Finally, 2? — 4 is reducible in the natural domain. 

Leaving these modifications of the conception of reducibility, we 
close this section with the following two definitions: 


DEFINITION 6. Two polynomials are said to be relatively prime if 
they have no common factor other than a constant. 


* By R (a1, 42, ++ dn) is understood the domain of rationality consisting of all 
numbers which can be obtained from the given numbers 4@j, -++ dm by the rational pro- 
cesses (addition, subtraction, multiplication, and division). In this notation the natural 
domain would be most simply denoted by R(1); the domain last mentioned in the 
text by R (1, V—1) or, even more simply, by R (V—1). This notation would not apply 
to all cases (e.g. the real domain) except by the use of an infinite number of arguments. 
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DEFINITION 7. Two methods of factoring a polynomial shall be 
said to be not essentially different if there are the same number n of 
factors in each case, and these factors can be so arranged that the kth 
factors are proportional for all values of k, from 1 to n inclusive. 


EXERCISES 


1. Prove that every polynomial in (2, y) is irreducible if it is of the form 
SO) +4 
where f (x) is a polynomial in z alone. 
Would this also be true for polynomials of the form 


S@)+y*? 
2. If £,¢,w are polynomials in any number of variables which satisfy the 
relation S=OU; 


and if the coefficients of fand ¢@ lie in a certain domain of rationality, prove that 
the coefficients of y will lie in the same domain provided @ ¥ 0. 


Gl. The Irreducibility of the General Determinant and of the Sym- 
metrical Determinant. 


THEOREM 1. The determinant 


Gy -Gyg, Og 
De M1 Aon 1) 
Gut Une Ce 


is an irreducible polynomial if its n? elements are regarded as inde- 
pendent variables. 


For suppose it were reducible, and let 
DEF (My) ++ Ayn) & (Asa +17 Gnn)s 


where neither f nor ¢ is a constant. Expanding D according to the 
elements of the first row, we see that it is of the first degree in a. 
Hence one of the two polynomials f and ¢@ must be of the first 
degree in a,,, the other of the zeroth degree. Precisely the same 
reasoning shows that if a, is any element of D, one of the polyno- 
mials f and ¢ will be of the first degree in a, While the other will 
not involve this variable. 

Let us denote by f that one of the two polynomials which involves 
a, any element of the principal diagonal of D. Then ¢ does not 
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involve any element of the 7th row or the 7th column. For if it did, 
since f is of the first degree in a, and ¢ is of the zeroth, their product 
D would involve terms containing products of the form a,a, or 
4,4; Which, from the definition of a determinant, is impossible. 
Consequently, if either one of the polynomials f and ¢ contains any 
element of the principal diagonal of D, it must contain all the ele- 
ments standing in the same row and all those standing in the same 
column with this one, and none of these can occur in the other 
polynomial. 

Now suppose f contains a,, and that ¢ contains any other element 
of the principal diagonal, say a, Then a;; and a,, can be in neither f 
nor ¢, which is impossible. Hence, if f contains any one of the ele- 
ments in the principal diagonal, it must contain all the others, and 
hence all the elements, and our theorem is proved. 


THEOREM 2. The symmetrical determinant 


Gy °° Un 
7 (a; = aj;) 
Any °°" Ann 


is an irreducible polynomial if its n(n +1) elements be regarded as 
independent variables. 


The proof given for the last theorem holds, almost word for word, 
in this case also, the only difference being that while D is of the first 
degree in each of the elements of its principal diagonal, it is of the 
second degree in each of the other elements. The slight changes in 
the proof made necessary by this difference are left to the reader. 


EXERCISES 
1. The general bordered determinant 
QYi+++ Ain Uless Ulp 
Anglos Ann Uni +++ Unp 


V1 +++ Vin O) rare!) 


Upl +++ Vpn AV iraterey ©) 
is irreducible if p <n, the a’s, w’s, and v’s being regarded as independent variables. 


2. The symmetrical bordered determinant obtained from the determinant in 
Exercise 1 by letting ay = aj, wy = vy is irreducible if p <n. 
N 
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3. If for certain values of i and j, but not for all, a, = a,j, but if the a’s are 
otherwise independent, can we still say that 


Ai. ++ Ayn 


tt : ioc th 
is irreducible ? Oe aL 


4. Prove that a skew-symmetric determinant (cf. Exercises 2, 3, §20) is always 
reducible by showing that, when it is of even order, it is a perfect square. 

[Sueeustion. Use Corollary 3, §11, and Theorem 6 and Exercise 1, § 76.] 

Does this theorem require any modification if the elements are aol and we 
consider reducibility in the domain of reals? 


62. Corresponding Homogeneous and Non-Homogeneous Polyno- 
mials. It is often convenient to consider side by side two polyno- 
mials, one homogeneous and the other non-homogeneous, which 
bear to one another the same relation as the first members of the 
equations of a plane curve or of a surface in homogeneous and non- 
homogeneous coérdinates respectively. Such polynomials we will 
speak of as corresponding to one another according to the following 
definition: 


DEFINITION. Jf we have a non-homogeneous polynomial of the kth 
degree in any number of variables (a2, -++%,_,) and form a new poly- 
nomial by multiplying each term of the old by the power of a new 
variable x, necessary to bring up the degree of this term to k, the homo- 
geneous polynomial thus formed shall be said to correspond to the given 
non-homogeneous polynomial. 


Thus the two polynomials 
(1) 203 + 3a%y—dxe®— yz +22 +2 —B8y —9Y, 
(2) 203 + 8a°%y — Save — yet + 22t + 2? — 3 yt? — 96, 


correspond to each other. 

It may be noticed that if ¢ (2,, -+-2,_,) is the non-homogeneous 
polynomial of degree &, the corresponding homogeneous polynomial 
may be written 

y ak “1, %, .., fact), 

Ln Ln, Lp, 


To every non-homogeneous polynomial there corresponds one, and 
only one, homogeneous polynomial. Conversely, however, to a 
homogeneous polynomial in m variables there correspond in general 


SOME PROPERTIES OF POLYNOMIALS IN GENERAL iy 


m different non-homogeneous polynomials which are obtained by set- 
ting one of the variables equal to 1. For instance, in the example 
given above, to (2) corresponds not only (1) but also 


(3) 2 4+8y —5 2 —yet4+ 22%+2 —3y?— 98, 
(4) 208 + 322 —5ee2®—-2t +22t+e?—32 —98, 
(5) 2+ 3a27y—5e —yt +2t +a? —3y?— 98. 


It should be noticed, however, that if one of the variables enters into 
every term of a homogeneous polynomial, the result of setting this 
variable equal to unity is to give, not a corresponding non-homo- 
geneous polynomial, but a polynomial of lower degree. In fact, in 
the extreme case in which every variable enters into every term of 
the homogeneous polynomial, there is no corresponding non-homo- 
geneous polynomial; as, for instance, in the case of the polynomial 


xy + vy*2 + ye. 


THEOREM 1. If one of two corresponding polynomials is reducible, 
then the other is, also, and the factors of each polynomial correspond to 
the factors of the other. 


For let $(2,,---%,) be a homogeneous polynomial of degree 
(k+1), and suppose it can be factored into two factors of degrees 
k and J, respectively, 


(6) Piri(Ly + En) = We (Lp + Un) Xi(Lyy *** Ln)- 


Now suppose the corresponding non-homogeneous polynomial in 
question is the one formed by setting z,=1. We have 


(7) Dis (Ly °°* Erp 1) = Wa (S yp °° Snpy L) Xi(@y ++ Pas 1). 


Since by hypothesis the degree of the polynomial on the left is 
unchanged by this operation, neither of the factors on the right- 
hand side of (6) can have had its degree reduced, hence neither 
of the factors on the right of (7) is a constant. Our non-homo- 
geneous polynomial is therefore reducible; and moreover the two 
factors on the right of (7), being of degrees k and / respectively, 
are precisely the two functions corresponding to the two factors on 
the right of (6). 
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Now, let ®,,)(2}, ++ @-1) be a non-homogeneous polynomial, and 
suppose 
re Dy 41(%p 0 G4) = WG, +++ Un) Xj(2) +++ Ln-4)s 


where the subscripts denote the degrees of the polynomials. Let 
disn Vrir.Xex be the homogeneous polynomials corresponding to 
®,V,X. Then when xv, +0, 


a ie x oe ae, By 
Dyyi(S, Zt) = W, wee 21) X, ee 21), 
an ip Ln ts HE, ee 


Multiplying this equation by a7’ we have 





Pps (L; “Un 45 ig) = Wi, (a4, “* Un49 Bn) x1 (a1, “t+ Tn 49 Ts 


an equation which holds whenever x, + 0, and, therefore, by Theorem 
5, § 2, is an identity. Thus our theorem is proved. 

As a simple illustration of the way in which this theorem may be 
applied we mention the condition for reducibility of a non-homo- 
geneous quadratic polynomial in any number of variables. By 
applying the test of § 47 to the corresponding homogeneous poly- 
nomial we obtain at once a test for the reducibility of any non- 
homogeneous quadratic polynomial. 


THEOREM 2. Jf f and > are non-homogeneous polynomials, and 
F, ® are the corresponding homogeneous polynomials, a necessary and 
sufficient condition that F and ® be relatwely prime is that f and 
d be relatively prime. 


For if f and ¢ have a common factor w which is not a constant, 
the homogeneous polynomial VY which corresponds to W is, by 
Theorem 1, a common factor of # and ®, and is clearly not a con- 
stant. Conversely, if WY is a common factor of F and ® which is 
not a constant, f and ¢ will have, by Theorem 1, a common factor 
which corresponds to Y'and which therefore cannot be a mere 
constant. 


63. Division of Polynomials. We will consider first two polyno- 
mials in one variable: 
tee = ay 2 + O21 bo + ay 


(1) 
p(x) = byu™ + b,a™ 1+. + Oy. 
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We learn in elementary algebra how to divide f by ¢, getting a 
quotient @ (a) and a remainder R(x). What is essential here is 
contained in the following theorem: 


THEOREM 1. Jf f and $ are two polynomials in x of which ¢ is 
not identically zero, there exists one, and only one, pair of polynomials, 
Q and R, which satisfy the identity 
(2) F(z) = C(x) 6(@) + RO), 
and such that either R=0,* or the degree of R is less than the degree of g. 

We begin by proving that at least one pair of polynomials Q, R 
exists which satisfies the conditions of the theorem. 

If f is of lower degree than ¢ (or if f= 0), the truth of this state- 
ment is obvious, for we may then let 9=0, R=f. 

Suppose, then, that fis of at least as high degree as ¢. Writing 
f and ¢ in the form (1), we may assume 


a #0, 6540, n2=m. 
Lemma. Jf ¢ ts not of higher degree than f, there exist two polyno- 
mials Q, and R, which satisfy the identity 
F(x) = Q(x) (x) + Ry(2), 
_ and such that either R,=0, or the degree of R, is less than the degree of f. 


The truth of this lemma is obvious if we let 
Q,(2)= 9 nm | 
by 


These two polynomials Q, and F, will serve as the polynomials Q 
and # of our theorem if #,=0, or if the degree of A, is less than the 
degree of ¢. If not, apply the lemma again to the two functions R, 


and ¢, getting Ri @)= Q,(2) (2) + FR, (2), 
where R, is either identically zero or is of lower degree than R, We 
may then write, (7) =[Q, (2) + Q,(z)] (2) + Ry (2). 


If #,=0, or if the degree of A, is less than the degree of ¢, we may 
take for the polynomials Q, & of our theorem, Q,+ @, and R,. If 
not, we apply our lemma again to R, and ¢. Proceeding in this way 


* It will be remembered that, according to the definition we have adopted, a polyno- 
mial which vanishes identically has no degree. 
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we get a series of polynomials ,, R,, --- whose degrees are con- 
stantly decreasing. We therefore, after a certain number of steps, 
reach a polynomial &; which is either identically zero or of degree 
less than ¢. Combining the identities obtained up to this point, we 


have F(e)=([%@ +--+ O(z)] $(2) + & (a), 


an identity which proves the part of our theorem which states that 
at least one pair of polynomials Q, # of the kind described exists.* 

Suppose now that besides the polynomials Q, & of the theorem 
there existed a second pair of polynomials Q’, R’ satisfying the same 
conditions. Subtracting from (2) the similar identity involving 
Q', B', we have 
(3) 0=(Q-@) d+(R-R). 

From this we infer, as was to be proved, 

Q=, R=Rf’. 

For if Q@ and Q! were not identical, the first term on the right of (3) 
would be of at least the mth degree, while the second involves no 


power of a as high as m. 
Turning now to polynomials in several variables : 


(4 Pp (Xyy ++ By) HAg( yy -++ Vy T+ My (Lyq +++ Ve)ay? + +++ +Ag(yy +++ Le), 
P(X yp +++ Wy) = Oo Lay ++ Wy) Uy A Oy Wy +++ eT EH +++ + On( Fay +++ Le) 
the ordinary method of dividing f by ¢ would give us as quotient and 


remainder, not polynomials, but fractional rational functions. In 
order to avoid this, we state our theorem in the following form: 


THEOREM 2. If f and ¢ are polynomials in (2, --- x,) of which 
is not identically zero, there exist polynomials Q, R, P, of which the 
last is not rdentically zero and does not involve the variable x,, which 
satisfy the identity, 

(5) Ply -- my)f (oy + 4) = Olay > a) (ayy 24) + Ry -~ 44) 
and such that either R =0, or the degree in x, of R is less than the degree 
im x, of >. 

The proof of this theorem follows the same lines as the proof 

of Theorem 1. 


* The reader should notice that the process just used is merely the ordinary process 
of long division. 
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If f is of lower degree in x, than ¢ (or if f=0), the truth of the 
theorem is obvious, for we may then let P=1, Q=0, R=f. 

Suppose, then, that f is of at least as high degree in 2, as ¢. 
Writing f and ¢ in the form (4), we may assume 


G20, 0,40, n 2 m: 


Lemma. Jf ¢ ts not of higher degree in x, than f, there exist two 
polynomials Q,, R, which satisfy the identity, 


bo( aos soe 2) F(X ASE L,)= Q,(ay age Lz) P (ay, at 2) + R(x, 500 £p)s 


and such that either R,=0, or the degree of R, in x, is less than the 
degree of f in xy. 


The truth of this lemma is obvious if we let 
1 = (Lay ++ By) BI” 


The polynomials Q,, R,, 6, will serve as the polynomials Q, R, P 
of our theorem if #, =0, or if the degree of F, in 2, is less than the 
degree of dinz,. If not, apply the lemma again to the two functions 
hk, and ¢, getting 


By (Bay +++ Lp) Ry (yy +++ Ly) = Vo Bp +++ Te)H(@p +++ Te) + Relay, +++ Lp), 


where A, is either identically zero or is of lower degree in 2, than R,. 
We may then write b2f=(b)Q,+ Q,)> + Ry 


If R,=0, or if the degree of FR, in z, is less than the degree of ¢ in 
2,, we may take for the polynomials Q@, &, P of our theorem the 
functions 6,9,+ Q,, A, 62. If not, we apply our lemma again to R, 
and ¢. Proceeding in this way, we get a series of polynomials 
R,, R,, --- whose degrees in x, are constantly decreasing. We there- 
fore, after a certain number of steps, reach a polynomial &; which is 
either identically zero, or of degree in x, less than d. Combining 
the identities obtained up to this point, we have 


bo F = (05719, + b 20, Ns Q:)> + hi, 
an identity which proves our theorem, and which also establishes the 
additional result : 


CoroLLARyY. The polynomial P whose existence is stated in our 
theorem may be taken as a power of bp. 
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We note that it would obviously not be correct to add to the 
statement of Theorem 2 the further statement that there is only 
one set of polynomials %, &, P, since the identity (5) may be multi- 
plied by any polynomial in (# ---z,) without changing its form. 
Cf., however, the exercise at the end of § 73. 


64. A Special Transformation of a Polynomial. Suppose that 
A(@y)%9%q%,) is a homogeneous polynomial of the Ath degree in 
the homogeneous codrdinates (a, 2, V3, %,), So that the equation 
f = 0 represents a surface of the Ath degree. If, in f, the term in 
vt has the coefficient zero, the surface passes through the origin; 
and if the term in x# (or x%, or x) has the coefficient zero, {he 
surface passes through the point at infinity on the axis of a, 
(or 2, or %). It is clear that these peculiarities of the surface 
can be avoided, and that, too, in an infinite variety of ways, by sub- 
jecting the surface to a non-singular collineation which carries over 
any four non-complanar points, no one of which les on the surface, 
into the origin and the three points at infinity on the codrdinate 
axes. It is this fact, generalized to the case of n variables, which 
we now proceed to prove. 


Lemma. Jf f(x,,---%,) is a homogeneous polynomial of the kth 
degree in which the term wk, ts wanting, there exists a non-singular 
linear transformation of the variables (x,,---2,) which carries f into a 
new form f,, in which the term in al has a coefficient different from 
zero, while the coefficients of the kth powers of the other variables have 
not been changed. 


In proving this theorem there is obviously no real loss of gener- 
ality in taking as the variable z,, the last of the variables z,. 
Let us then consider the non-singular transformation 
x, =a + aga! (¢= 1, ---n—1). 


Fal 
This transformation carries f into 
(ee I ea ae i ! 
FQ MySay + ayy <a), + a, 30, 2G) 


and evidently does not change the coefficients of the terms in 


k k 
Dio Uh 
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Now, since every term in f,, except the term in 2", contains at least 


a of the variables 2/, --- 2/,_,, the coetticient of the term in 2’ will 
[3] £ 
FO OC, ate (ays dy, 1). 


Our lemma will therefore be proved if we can show that the 
constants a, ---a,_, can be so chosen that this quantity is not zero. 

Let us take any point (0,,---6,) for which 6,#0; and consider 
a neighborhood of this point sufficiently small so that 2, does 
not vanish at any point in this neighborhood. Then, since f 
does not vanish identically, we can find a point (¢,,+--¢,) in this 
neighborhood (so that ¢,+#0) such that 


Key +++ Cn) #9. 


If now we take for aj, ---a,_, the values ¢,/¢y +++ C,—1/Cn) We shall 
have, since f is homogeneous, 


Say “5 An_— 49 1) aN. 
Thus our lemma is proved. 


THEOREM 1. Jf f(2,,---%,) is a homogeneous polynomial of the kth 
degree, there exists a non-singular linear transformation which carries 
F into a new form f, in which the terms in ws ve gl. all have coefficients 
different from zero. 


The proof of this theorem follows at once from the preceding 
lemma. For we need merely to perform in succession the trans- 
formations which cause the coefficients first of at, then of 2%, etc., 
to become different from zero, and which our lemma assures us 
will exist and be non-singular, to obtain the transformation we 
want. To make sure of this it is necessary merely to notice that 
the coefficient of af obtained by the first transformation will not 
be changed by the subsequent transformations; that the same 
will be true of the coefficient of xk obtained by the second trans- 
formation ; etc. 


THEOREM 2. If f(x,, --- %,) is a polynomial of the kth degree 
which is not necessarily homogeneous, there exists a non-singular 
homogeneous linear transformation of (a, +++ %) which makes this 
polynomial of the kth degree in each of the variables x}, --- x}, taken 
separately. 
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If f is homogeneous, this is equivalent to Theorem 1. If f is 
non-homogeneous, we may write it in the form 


S@yp ar Un) = $y (Ly . im) Pi-1 (Lp BS Tn) ee + $1 (4%; ite Ly) + dos 


where each ¢ is a homogeneous polynomial of the degree indicated 
by its subscript or else is identically zero. We need now merely to 
apply Theorem 1 to the homogeneous polynomial ¢,, which is, of 
course, not identically zero. 

This theorem, and therefore also Theorem 1, which is merely a 
special case of it, admits the following generalization to the case ofa 
system of functions: 


THEOREM 3. If we have a system of polynomials 
VG@at? flap sae Fri@yy “+ Bq)s 


of degrees k,, ky, +++ km respectively, there exists a non-singular homo- 
geneous linear transformation which makes these polynomials of degrees 
key +++ Kem tn each of the variables x}, --- x}, taken separately. 


This theorem may be proved either by the same method used in 
proving Theorems 1, 2; or by applying Theorem 2 to the product 


Site “+ Sine 


CHAPTER XV 


FACTORS AND COMMON FACTORS OF POLYNOMIALS IN ONE 
VARIABLE AND OF BINARY FORMS 


65. Fundamental Theorems on the Factoring of Polynomials in 
One Variable and of Binary Forms. Theorem 2, § 6 may be stated 
in the following form: 


THEOREM 1. A polynomial of the nth degree in one variable is 
always reducible when n>1. It can be resolved into the product of n 
linear factors in one, and essentially in only one, way. 


By means of § 62 we can deduce from this a similar theorem in 
the case of the binary form 


(1) Ayr + axe lay + ++ + A,2h. 


Let us first assume that a,#+0. Then the non-homogeneous poly- 
nomial 
(2) Aye + ayeT tb os + My 
corresponds to (1), according to the definition of §62. Factoring 
(2), we get 

(4, — %4)(y — tty) +++ (% — Oy), 
or, if we take n constants /, aj, --- # whose product is dp, 


(3) (alfa, — a (abla — 0) ++ (aint, — Oy )s 


where for brevity we have written 


oa, = of (= 1,2) +». 2). 


By Theorem 1, § 62, we now infer that the binary form (1) is identi- 
cally equal to 


(4) (ety — 04% q) (Oy — Hy) +++ (yey — Mya). 
187 
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Moreover, there cannot be any way essentially different from this of 
factoring (1) into linear factors, for if there were we should, by 
setting v, = 1, have a way of factoring (2) into linear factors essen- 
tially different from (3). Thus our theorem is proved on the suppo- 
sition that a) + 0. 

Turning now to the case a, = 0, let us suppose that 


a SHS SHS ay= 0, Ap 0, 


where k < n. The form (1) then has the form 


-k,k 
(5) aya kak 4 e+ 4 AnXh, 


which is equal to the product of & factors x, and the binary form 


n—k mn — ke 
yy AR = a, 


of degree n —k. Since the first coefficient in this form is not zero, 
it can, as we have just seen, be factored into  — & linear factors. 
Thus, here also, we see that the binary form can be written in the 
form (4), the only peculiarity being that in this case & of the con- 
stants «!’ are zero. We leave it to the reader to show that this 
factoring can be performed in essentially only one way. This being 
done, we have the result: 


THEOREM 2. A binary form of the nth degree is always reduc- 
ible when n>1. It can be resolved into the product of n linear 
factors in one, and essentially only one, way. 


EXERCISES 


1. Prove that every real polynomial in one variable of degree higher than two 
is reducible in the domain of reals, and can be resolved into irreducible factors in 
one, and essentially only one, way. 


2. Prove the corresponding theorem for real binary forms. 


66. The Greatest Common Divisor of Positive Integers.* We 
will consider in this section the problem of finding the greatest com- 
mon divisor of two positive integers a and 6, which has the closest 


* In this section we use the term divisor in the arithmetical sense, not in the 
algebraic sense defined in §60, An integer 0 is said to be a divisor of an integer a if 
an integer c exists such that a = be. 
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analogy with the algebraic problem of the next section. The solu- 
tion of this problem, which was given by Euclid, is as follows: 

If we divide a by 6* and get a quotient g) and a remainder 7,, we 
may write a=qb+ryp 
where, if the division is carried out as far as possible, we have 7,<0. 

Then divide 6 by 7, getting a quotient g, and a remainder 7, 
which, if the division is carried out as far as possible, is less than 7. 
Proceeding in this way, we get the following system of equations, 
in which, since the remainders 7, 7,, --- are positive integers which 
continually decrease, we ultimately come to a point where the divi-. 
sion leaves no remainder : 


a=qo+r Pio; 
6b = "4 + Up) oe <a an 
= Glo 13 <7 
3 2) 
(1) ‘esas 
To-2 = Yp-1% 9-1 + 1p "p< To-13 
"p-1 = Up" 0< 1. 


From the first of these equations we see that every common factor 
of a and 6 is a factor of r,; from the second, that every common 
factor of 6 and 7, is a factor of r,; etc.; finally, that every common 
factor of r,_, and r,_, is a factor of 7,. Hence every common factor 
of a and b is a factor of r,. 

On the other hand, we see from the last equation (1) that every 
factor of 7, is a factor of 7,_,; from the next to the last equation, that 
every common factor of r, and r,_, is a factor of 7,_,; etc.; finally, 
that every common factor of r, and r, is a factor of b, and that 
every common factor of r, and 6 is a factor of a. Hence every factor 
of r, 1s a common factor of a and 6. 

Since the largest factor of r, is r, itself, we have the result : 


THEOREM 1. In Euclid’s algorithm (1), the greatest common divisor 
of a and b is r,. 

In particular, a necessary and sufficient condition that a and 6 be 
relatively prime is that r,= 1. 


* This is possible even if a<b, the only peculiarity in this case being that the 
' quotient is zero and the remainder equal to a. 
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We will next deduce from the equations (1) an important formula 
by means of which 1, is expressed in terms of a, 6, and the q’s. 
From the first equation (1) we have 


1p =a—qQ). 
Substituting this value into the second equation, we get for r, 
the value = — H44+(HH +16. 
Substituting the values for 7, and rv, just found in the third 
equation, we Beh 74 — (q49. + 1) 4 — (quar + % + %)>- 
Proceeding in this way, we can express each of the 7’s, and there- 
fore ultimately 7,, in terms of a and b. In order to express con- 


veniently the general formula here, we introduce the following 
notation : 


[]J=1! 
[e,] = 4, 
[4.0%] = 04+ I, 
(2) [4 gy Og] = Oy tty eg + Oy + Oy, 


[¢,, aye a, | = [ a, diss Op—y | _ + [5 aes 9 |< 


It will be seen that the values of 7,, 7,, 7, found above are included 
in the formula 


(3) 1 = (— 1S [On das + Gea) @ + (— 1 [905 91 Gas *** G11 - 


By the method of mathematical induction this formula will therefore 
be established for all values of k<p if, assuming that it holds when 
kSk,, we can show that it holds when k=k,+1. This follows at 
once when, in the formula 


Tre = h-1 ~ Vey ky? 


we substitute for 7, and r,,_, their values from (3) and reduce the 
resulting expression by means of the definitions (2). 
We have therefore established the formula 


(4) 1, = Aa+ Bb, 
where A=(— pes [9 Io" 9-1], B=(— 1? [% wy %-1]- 


Since the q’s are integers, it is clear that A and B will be integers. 


—— 
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The most important application of the result just obtained is to 
the case in which a and 6 are relatively prime. Here r,=1, and we 
have ‘ 

(5) Aa+ Bb=1. 


Conversely, if two integers A and B exist which satisfy (5), a and 0 
are relatively prime, as otherwise the left-hand side of (5) would 
have a factor greater than 1. 


THEOREM 2. A necessary and sufficient condition for a and 6 to be 


relatively prime is that there exist two integers A and B such that 
Aa+ Bb=1. 


EXERCISES 
1. Prove that [01, @2, +++ Gn] = [On G@n—1, +++ 01]. 
[Suecestion. Use the method of mathematical induction.] 
2. Prove that the numerical values of the integers A and B found above are 
respectively less than 4) and ia. 
[Suecestion. Show that a/b =[q -+-9]/[qu +++ 4], and that this second 


fraction is expressed in its lowest terms. ] 


3. Prove that there can exist only one pair of integers A and B satisfying the 
relation Aa + bb =1 and such that A and B are numerically less than 45 and ia 
respectively. 


67. The Greatest Common Divisor of Two Polynomials in One 
Variable. In place of the integers a and b of the last section, we 
consider here the two polynomials: 


= nv n—1 Ace - 
(1) F(x) = au” + az” 14+ ++ ta, 


p(x) = byx™ + ba 14 + +5, 


By the greatest common divisor of these two polynomials is 
meant their common factor of greatest degree.* It will turn out in 
the course of our work that (except in the case in which f and ¢ are 
both identically zero) this greatest common divisor is completely 
determinate except for an arbitrary constant factor which may be 
introduced into it. 


* Many English and American text-books use the term highest common factor ; but 
as there is not the slightest possibility that the word greatest, here, should refer to the 
value of the polynomial, since the polynomial has an infinite number of values for dif- 
ferent values of the argument, it seems better to retain the traditional term. 
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We will assume that neither f nor ¢ is a mere constant, and that 
the notation has been so introduced that f is of at least as high 
degree as ¢; that is, we assume 

#0, #9, n>m>%. 


Let us apply Euclid’s algorithm to f and precisely as in § 66 we 
applied it to aand 6. We thus get the system of identities 


f (a) = Q(z) p(x) + Ka), 
(x)= Q(x) R(x) + Bz), 
(2) R,(x)= 0,(2) R(x) + B(2), 


Ei, -1(*) = Q(x) R(x) os Rost 
For the sake of uniformity we will write 
$(x) = (x). 
Then R,, R,, R,, are polynomials of decreasing degrees, so that 
‘after a finite number of steps a remainder is reached which is a con- 
stant. This remainder we have indicated by #,.4. 

From this algorithm we infer, as in $66, that every common 
factor of f and ¢ is a factor of all the F’s, and, on the other hand, 
that every common factor of two successive F’s is a factor of all the 
preceding F’s and therefore of f and ¢. Accordingly, if fand ¢ 
have a common factor which is not a constant, this common factor 
must be a factor of the constant #, ,,, and therefore R,,,=0. Con- 
versely, if #,,,= 0, the polynomial #,(x) is itself a common factor 
of R, and #,,,, and therefore of fand ¢. Hence the two theorems: 

THEOREM 1. A necessary and sufficient condition that two poly- 
nomials in one variable f and do, neither of which is a constant, be 
relatively prime is that in Euclid’s algorithm, (2), Ry. #0. 


THEOREM 2. Jf in Euclid’s algorithm, (2), R,., =, then R(x) is 
the greatest common divisor of f and ¢. 

By means of this theorem we are in a position to compute the 
greatest common divisor, not only of two, but of any finite number, of 
polynomials in one variable. Thus if we want the greatest common 
divisor of f(x), d(x), W(x), we should first compute, as above, the 
greatest common divisor R(x) of f and ¢, and then, by the same 
method, compute the greatest common divisor of R(x) and (2). 
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From the identities (2) we can compute the value of each of the 
remainders in terms of f, ¢, and the quotients Y. The formule 
here are precisely like those of § 66, and give for R,,, the value 
(3) Fon =(— VPC) @(2) + Oe) F@) 

+(— Tt [Qe U@), - Ce) 1G) 

Suppose, now, that f and ¢ are relatively prime. We may then 
divide (3) by #,., and get 
(4) Fa)yf _ a P(x) p(w) = 1, 


where 
(5) i 2)= Fe ee) O{2), -0,(2)| 





a(a)='5, ty —| aula Qa) @4C2)]. 


From the definitions (2), : i" we see that # and ® are polyno- 
mials in z. The existence of two polynomials F and ® which satisfy 
(4) is therefore a necessary condition that f and ¢ be relatively prime. 
It is also a sufficient condition ; for from (4) we see that every com- 
mon factor of fand ¢ must be a factor of 1, that is, must be a con- 
stant. Thus we have proved the theorem: 


THEOREM 3. A necessary and sufficient condition that the polyno- 
mials f(a) and $(x) be relatively prime is that two polynomials F(x) and 
D(x) exist which satisfy (4).* 

We can make this statement a little more precise by noting the 
degrees of F and ® as given by (5). For this purpose let us first 
notice that if %,---%, are polynomials of degrees k,, ---k, respec- 
tively, [%,, ---%,] will, by (2), § 66, not be of degree greater than 
ki +++ +p. Now let the degree of R(x) be m, and, as above, the 
degrees of f and ¢, n and m respectively. Then (cf. (2)) the degrees 
of Q 91 Ga °*: will be n— m, m— m,, m, —M,, «-- respectively. Ac- 
cordingly, by (5), the degrees of # and ® are respectively not greater 


than (m — m1) + (Mm, — My) + + +(M,-14 — M) = M— Mz, 


and (m —m) + (m — my) + (am, — my) + ++ + (my — Mp) = M— My. 
Hence, since m, >0, # is of degree less than m, and ® of degree less 
than n. 


* The proof we have given of this theorem applies only when neither fnor ¢ is a 
constant. The truth of the theorem is at once obvious if f or ¢ is a constant. 
Co) 
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Conversely, we will now show that if #, is a polynomial of degree 
less than m, and ®, a polynomial of degree less than n, and if 


(6) E(x) f(x) + ®(x) pr) = 1, 
then Ei(c)=F(e), (x)= (2). 


To prove this, subtract (6) from (4), getting 
(F— F,)f=(P,—®)6. 


If we resolve the two sides of this identity into their linear fac- 
tors, we see that, since f and ¢ are relatively prime, f must be a 
factor of ®, — ® and ¢ a factor of F—F,. This, however, is pos- 
_ sible only if @,—® and F—F, vanish identically, as otherwise 
they would be of lower degree than f and ¢ respectively. ~-We 
have thus proved the theorem : 


TueroreM 4. Jf f(x) and ¢(2) are relatively prime, and neither is 
a constant, there exists one, and only one, pair of polynomials F(x) 
P(x), whose degrees are respectively less than the degrees of and f, 
and which satisfy the identity (4). 

Before proceeding to the general applications of the principles 
here developed which will be found in the next section, the reader 
will do well to familiarize himself somewhat with the ideas involved 
by considering the special case of two polynomials of the second 
degree: Se) % 2+ a, 2+ a, dy 05 

(x) = by) a+ b,a+ 6, ees 

If the condition that these two polynomials be relatively prime 
be worked out by a direct application of Euclid’s algorithm, it 
will be found necessary to consider separately the cases in which 
a,b) — ab, is or is not zero. By collating these results it will be 
found that in all cases the desired condition is: 

(259 — gba)? + (442q — Agbs )(ayby — Ayb4) #0. 
This condition may be found more neatly and quickly by obtaining 
the condition that two polynomials of the form 
B(x)= pot + Py 
Px) qrt+H 
exist which satisfy the identity (4). 


It is this last method which we shall apply to the general case in 
the next section. 
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68. The Resultant of Two Polynomials in One Variable. Let 
F(@) = ax” + aya" + «- +a,v+ a, a #9, n>0, 
(x) = byu™ + bya™ 14. +O, eo + by b= 0, nv > 0. 
The condition that these polynomials be relatively prime consists, 
as we see from Theorem 4, § 67, in the existence of constants pp, 
Bie 1 Yor Ga =< Gna SUCH that 
(poe + pie 2+ -* + Pm hat? + yt" + + + Gy) 
SAG Gye Gan Oe tO pes Og EL 
Equating coefficients of like powers of x, we see that the following 
system of equations is equivalent to the last written identity: 





[a Po + 9% = 
“Po i “oP 1 an Pro y fon = 
InPo = Am—1P 1 ny i ae al fe bata oe tte a fe = 
\) Gn+1Po i An P41 aaa Ay Pm—1 Be aes 5 a ae aoe = Pod m+1 = 
an Po ae U1 Py pbcoorin Bn—m+1Pm-1 + ae —m uA i Svuep ie ae ‘i = 0, 
On Py ++ + On—mi2Pm-1 + Pin Saal ee Pra = =), 
L An Pm-1 eres — =1. 


In writing these equations we have assumed for the sake of 
definiteness that n 2m, though the change would be immaterial if 
this were not the case. This is a system of m+n linear equations 
in the m+n unknowns Pp, -*- Pm Yo *** An—1» Whose determinant, after 
an interchange of rows and columns and a shifting of the rows, is 


ON ee | ICL SE Ne ot i on en 

CE nee aN Lies os Mt: oo) ly Os gm amen 

Ap» An, 0 0 a, An 
BG” ")= 0 Oe 0 By aes, 

0 0 dy bi Pai 


Ve 0s. 0 
a determinant which, it should be noticed, has m+n rows and columns. 
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If R#0, the set of equations (1) has one and only one solution, 
and f and ¢ are relatively prime. If R=0, two cases seem at first 
sight possible (cf. § 16): either the system of equations has no solu- 
tion, or it has an infinite number of solutions. This latter alterna- 
tive cannot, however, really arise, for we have seen in Theorem 4, 
§ 67, that not more than one pair of polynomials # and ® can exist 
which satisfy formula (4) of that section and whose degrees do not 
exceed m—1 and n—1 respectively. Accordingly, if R=0, the 
set of equations has no solution and f and ¢ have a common factor. 

RB is called the resultant of f and $.* 

The term resultant has thus been defined only on the Sonne 
that f and ¢ are both of at least the first degree. It is desirable to 
extend this definition to the case in which one or both of these 
polynomials is a constant. Except in the extreme case m=n= 0, 
we will continue to use the determinant # as the definition of the 
resultant. Thus when m=0, n>0 we have 


RCO TIE n(n—1) a 
R( j=) zon, 


If 6,#0 we have ##0, and moreover in this case f and ¢ are 
relatively prime since the constant ¢@ has no factors other than 
constants. If, however, 6)=0, we have &=0, and every factor of 
Ff is a factor of ¢, since d is now identically zero. 

Similarly when n= 0, m> 0, we have 


a(p.,J=# 
Boy ++ Om 0 


and we see that a necessary and sufficient condition that f and ¢ be 
relatively prime is that R#0. 

Finally, when n= m= 0, we define the symbol R (3°) which we 
still use to denote the resultant, by the formula b 

R (0) a i when a, and 4, are not both zero, 
by 0 when ¢,=—6,= 0. 

We may now say with entire generality: 

THEOREM. A necessary and sufficient condition for two polynomials 
im one variable to be relatively prime is that their resultant do not vanish. 

For another method of approach to the resultant, cf. Exercise 4 
at the end of § 76. 


*Itshould be noticed that the resultant of ¢ and f may be the negative of the 
resultant of fand ¢. This change of sign is of no consequence for most purposes. 
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69. The Greatest Common Divisor in Determinant Form. 

DEFINITION. By the ith subresultant R,; of two polynomials in 
ene variable is understood the determinant obtained by striking out the 
jirst t and the last i rows and also the first i and the last ¢ columns from 
the resultant of these polynomials. 

Thus if the polynomials are of degrees 5 and 3 respectively, the 
resultant & is a determinant of the eighth order, R, of the sixth, 
fi, of the fourth, and R, of the second, as indicated below : 


My ay Ay as Uy Or 0 0 





We now state the following results, leaving their proof to the 
reader: 


Lemma. Jf f(x) and $,(x) are polynomials, and 
(a= (a@-a) f(z), Pw) =(e@— «)Pi(2), 


the resultant of f, and >, and their successive subresultants are equal 
respectively to the successive subresultants of f and >. 


THEOREM 1. The degree of the greatest common divisor of f(x) 
and (2) is equal to the subscript of the first of the subresultants 
R,=R, Ry Ry, -- which does not vanish. 


THEOREM 2. Jf i is the degree of the greatest common divisor of 
two polynomials f(x) and p(x), then this greatest common divisor may 
be obtained from the ith subresultant of f and ¢ by replacing the last 
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element in the last row of coefficients of f by f(x), the element just above 
this by wf(x), the element above this by a f(a), ete.; and replacing 
the last element in the first row of coefficients of > by p(x), the element 
below this by xd(x), ete. 

If, for instance, the degrees of f and ¢ are 5 and 3 respectively, 
and i= 1, the greatest common divisor is 


My Ay Ay, as a, xf (x) 

0 oN ay Ay, as F(2) 

0 0 0 by b, (x) 

0 0 by b, by ap(x) | 

0 by b, by bs x(x) 

by b, by b, 0 239(2) 
70. Common Roots of Equations. Elimination. Consider the 
equations f(x) = aye" + az"14--+a,=0 Oy #0, 
f(x) = byx™ + bya" 14 --- +6, = 0 b,+#9, 


whose roots are @,, @,---@, and By, By, ---B,», respectively; and 
suppose f(z) and ¢(~) resolved into their linear factors: 


S(®) = aq (@ — 0% w — Oy) ++ (@— Oy), 

(x) = bw — By)(w — By) — Bm) 
Since, by Theorem 1, § 65, these sets of -factors are unique, it is 
evident that the equations f(7)= 0 and ¢(z)= 0 will have a common 
root when, and only when, f(z) and ¢(z) have a common factor, 
that is, when, and only when, the resultant R of f and ¢ is zero. 

To eliminate x between two equations f(v)=0 and ¢(x)=90, is 
often taken in elementary algebra to mean: to find a relation between 
the coefficients of f and ¢ which must hold if the two equations are 
both satisfied; that is, to find a necessary condition for the two 
equations to have a common root. For most purposes, however, 
when we eliminate we want a relation between the coefficients which 
not only holds when the two equations have a common root, but 
such that, conversely, when it holds the equations will have a 
common root. From this broader point of view, to eliminate 2 
between two equations f(7)=0 and ¢(2)=0 means simply to find 
a necessary and sufficient condition that these equations have a 
common root. Hence the result of this elimination is R=0. Let 
us, however, look at this question from a little different point of view. 
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In the equations 
(1) Aye + aya* + agv? + ase? + aye + a, = 0 ay #0, 
(2) box? + bia? + b,x + b, = 0 6,0, 
let us consider the different powers of 2 as so many distinct un- 
knowns. We have, then, two non-homogeneous, linear equations in 
the five unknowns 2, 2”, 2%, z*, 2. Multiplying (1) through by x 
and then by 2%, and (2) by a, 2’, 2, x‘, in turn, we have 
Ayu! + a,x + aga? + agrt + ax? + a2? =; 
Ayu + a,x + art + ag? + ayn? + age = 0, 
Aye? + 4,24 + anv? + age +aae+a, =0, 
boa? + 6,27 + b,2+6, =0, 


boat + ba? + 6,27 + bax = 0, 

bx? + 0,24 + 6,23 + baa? = 0, 

628 + 6,25 + b,04 + bax8 =; 

box? + 6,0° + bya? + bgr* = 0; 


a system of eight non-homogeneous, linear equations in seven 
unknowns. 

If a value of ~ satisfies both (1) and (2), it will evidently satisfy 
all the above equations. These equations are therefore consistent, 
so that by Theorem 1, § 16, we have. 


ay ay Ay ag U4 As 0 0 
0 ay ay Ay As a, a7 
0 0 My ay An, As a, as 
0 0 0 0 by b, by. 270g. | 0 
0 0 0 Dre hs tee 0 ; 
0 0 by b, by bs 0 0 
0 by b, b, bs 0 0 0 


by b, by bs 0 0 0 0 
Hence the vanishing of this determinant is a necessary condition for 
(1) and (2) to have a common root. 
This device is known as Sylvester's Dialytic Method of Elimina- 
tion.* 


* For the sake of simplicity we have taken the special case where n = 5 and m=3. 
The method, however, is perfectly general. 
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The above determinant is seen to be exactly the same as the 
resultant R of (1) and (2), so that Sylvester’s method leads to the 
same condition for two equations to have a common root as that 
found above, namely R=0. It does not prove, however, that this 
condition is sufficient, but merely that it is necessary. Thus Syl- 
vester’s method, while brief, is very imperfect. 

The number of roots common to two equations, f(7)=0 and 
¢(x)= 0, and also an equation for computing the common roots, 
may be obtained at once from § 69. 


71. The Cases ay=0 and 6,=0. It is important for us to note 
that according to the definitions we have given, the determinant 


R a "*@n\ will be the resultant of the two polynomials 
0° m f (2) = yx” + aa Be cae + Any 

f(z) = bya” + bya” 14 --- +b, 
only when f and ¢ are precisely of degrees n and m respectively, 
that is, only when ay#0, 6,40. Thus, for instance, the resultant of 
the polynomials F(a) = a2" + a2 2 4 + dy 

$(a) = yu +b,a™24 + by : 
is not the (m+n)-rowed determinant R(? ae) but, if a,+0, 

eg 

6) #0, the (m+n—1)-rowed determinant R & ig =) or, if a, or by 
is zero, a determinant of still lower order.* a 


Let us indicate by & the (m+ n)-rowed determinant A Ge tas pe 
0° eisse m 


and by r the resultant of f and ¢, and consider the case ay=0, 
a,#0,b)#0. Since every element of the first column of & except 
the last is zero, we may write 

Oi (oe aaa, J 
In a similar way we see that if the degree of f is n —7, and 6,+0, we 


may write R= + bir, 
and if the degree of ¢ is m— 2, and a, #0, we have 
R= ajr. 


Accordingly, except when a,=},=0, R differs from r only by a 
non-vanishing factor. 
* As an illustration take the two polynomials f(#)=(a+6)a?+%—£ and 


¢(@)=ar+1. If a+840 and a0, the resultant here is (a2—1)f. But if 
a=— 6-0, the resultant is 1 —a?. 
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THEOREM. Although R oe ~ A ") is the resultant of f and o only 
agen be 
when ay+9 and b,#0 (or when m=0 or n=0), nevertheless its 
vanishing still forms a necessary and sufficient condition that f and > 
have a common factor even when ay=9 or b)=0, provided merely that 
both ay and by are not zero. 


That this last restriction can not be removed is at once evident; 
for, if ay=b)=0, every element in the first column of the determinant 
is zero, and hence the determinant vanishes irrespective of whether 
fand ¢ have a common factor or not.* All that we can say, if we 
do not wish to make this exception, is, therefore, that in all cases the 
vanishing of # forms a necessary condition that f and @ have a 
common factor. 


72. The Resultant of Two Binary Forms. Let us now consider 
the binary forms 


F(@y, Ly) = Ay UF + A272 + ++ + A, aR (n21) 
P(Xy, Ly) = boat + bya zy + ++ +b,,0% (m21). 
By the side of these forms we write the polynomials in one 
variable F(z) = ag2* + a,x" 4 + py 
D(x) = bya" + bya™ 14+ - +O,. 
The determinant Re ¥ ‘a 
eae 


will be the resultant of # and ® only when neither a, nor 0, is zero. 
We will, however, call it the resultant of the binary forms f and ¢ 
in all cases. 


* By looking at the question from the side of the theory of common roots of two 
equations (cf. § 70), and by introducing the conception of infinite roots, we may avoid 
even this last exception. An equation 

At” + ayu"-14 «+4, =0 
has n roots, distinct or coincident, provided a0. If we allow a, to approach the value 
zero, one or more of these roots becomes in absolute value larger and larger, as is seen 
by the transformation #! =1/a. Hence it is natural to say that if a) =0 the equation 
has an infinite root. If then we consider two equations each of which has an infinite 
root as having a common root, we may say: 

A necessary and sufficient condition that the equations 

MX” + AH") + + + On =0 n>0, 
beam + bye™ 14 oe +.B = 0 m > 0> 


GaN Ey oe: Uy *** On 
have a common root is in all cases the vanishing of R fe b it 
: 0? eee ‘m 
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THEOREM. A necessary and sufficient condition for two binary 
forms to have a common factor other than a constant is that ther 
resultant be zero. 

If a) and 4, are both different from zero, the non-homogeneous 
polynomials / and ® correspond to the forms f and ¢ according to 
the definition of § 62. Accordingly, by Theorem 2 of that section, 
a necessary and sufficient condition that f and ¢ have a common 
factor other than a constant is, in this case, the vanishing of their 
resultant. 

On the other hand, if a) =6,=0, f and ¢ have the common factor 
v, and the resultant of f and ¢ obviously vanishes. 

A similar remark applies to the case in which all the a’s or-all 
the 6’s are zero. 

There remain then only the following two cases to be considered, 
(1) A203 6) 0, = ee == 070, 0 (k<m), 
(2) b, 05 A =a= + =¢4,=0, Ans #9 (k<n). 


In Case (1), F corresponds to f, and, if we write 
P(X Vy) = 1p Py(Ly 7), 


® corresponds to ¢,. Now we know in this case (cf. § 71) that 
R+#0 is a necessary and sufficient condition that F and ® be rela- 
tively prime. Accordingly, by Theorem 2, § 62, it is alsoa necessary 
and sufficient condition that f and ¢, be relatively prime. But since 
@, is not a factor of f, the two forms f and ¢ will be relatively prime 
when and only when f and ¢, are relatively prime. Thus our theo- 
rem is proved in this case. 
The proof in Case (2) is precisely similar to that just given. 


CHAPTER XVI 
FACTORS OF POLYNOMIALS IN TWO OR MORE VARIABLES 


73. Factors Involving only One Variable of Polynomials in Two 
Variables. We have seen in the last chapter that polynomials in 
one variable are always reducible when they are of degree higher 
than the first. Polynomials in two, or more, variables are, in gen- 
eral, not reducible, as we have already noticed in the special case of 
quadratic forms. 

Let f(x, y) be any polynomial in two variables, and suppose it 
arranged according to powers of 2, 


SB YH M(Y)U" + YO TA oe Fn (Y)% + ALY), 
the a’s being polynomials in y. 
THEOREM 1. A necessary and sufficient condition that a poly- 


nomial in y alone, Wy), be a factor of f(x, y) is that it be a factor of 
all the a’s. 


The condition is clearly sufficient. To prove that it is necessary, 
let us suppose that y(y) isa factor of f(a, y). Then 


GQ) a(y)ar+ + + aly) = Wy) oo(y)2” + + +8,(9)], 


where the 6’s are polynomials in y. For any particular value of y 
we deduce from (1), which is then an identity in a, the following 


equations : ay(y) = v(y)bo(y)> 
a(y) = (yb) 
an(y) = ¥(y) Oy). 


Since these equations hold for every value of y, they are identities, 


and w(y) is a factor of all the a’s. 
: 203 
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TuroreM 2. Jf f(a, y) and ¢(a, y) are any two polynomials in x 
and y, and Wy) is an irreducible polynomial in y alone* which 18 a 
factor of the product fp, then is a factor of f or of >. 

Let f(a, y)=ag(y)a* + Gyan t+ + any) 
and P(t, y) = Oo(y)a™ + By(y)am—E Ho + Om(Y)s 
then 

F(@ YY PBs Y) = Ay bya"*™ + (yb, + Ady arr 
+ (diybg + Ab, + Aqb) art —2 + oe + Oy Dy 


In order to prove that w is a factor either of f or of ¢ we must 
prove that it is either a factor of all the a’s or of all the 6’s._ If this 
were not the case, we could find a first a in the sequence dp, a, ---a, 
of which wis not a factor. Call this function a; There would also 
be a first 6 in the sequence of 8), ,, --- 4, which is not divisible by wp. 
Call this function 6,. Our theorem will be proved if we can show 
that this assumption, that a; and 0, are not divisible by y while all 
the functions dp, --- a;_ 4, 09, +: ;_4, are divisible by y, leads to a 
contradiction. For this purpose let us consider in the product f¢ 
the coefficient of e—®*+™—9, which may be written 


Oy Diageo vee pg Op + gd; + ing By Hoos + Ain 7 Dq, 


provided we agree that the a’s and 6’s with subscripts greater than 
m and m respectively shall be identically zero. Since f¢ is by 
hypothesis divisible by y, it follows from Theorem 1 that the last 
written expression must be divisible by wy. This being obviously 
the case for all the terms which preceed and for all which succeed 
the term a,),, it follows that this term must also be divisible by yw, 
so that among the linear factors of the function a,6; must be found 
vy. But by Theorem 1, § 65, the function a6; can be resolved into 
its linear factors in essentially only one way, and one way of so re- 
solving it is to resolve a; and 6, into their linear factors. Since yw is 
not one of these factors, we are led to a contradiction, and our 
theorem is proved. 
An important corollary of our theorem is: 


CorotLaRy. Let f(x,y) and ¢$ (a, y) be polynomials in (a, y), and 
let W(y) be a polynomial in y alone. If wy is a factor of the product of 
Sh but is relatively prime to $, then w is a factor of f. 


* That is, a linear polynomial. . 
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If y is irreducible, this corollary is identical with the theorem. 
Let us suppose y resolved into its irreducible factors none of which 
are constants, that is, into its factors of the first degree: 


WY) = VY) Vay) > Wily): 
Now consider the identity which expresses the fact that is a factor 
of fo: 
(2) S(t YPM Y= VAYVAY) - VLYVE™ Y). 
This shows that y,(y) is a factor of f¢ and hence, by Theorem 2, 


it is a factor either of f or of ¢. Since p and ¢ are relatively 
prime, y, cannot be a factor of d. It must then be a factor of f: 


S(®% YV=VIQMA Y)- 


Substituting this in (2), and cancelling out y,, as we have a right to 
do since it is not identically zero, we get 


(3) Fs YP YV=WLY) - VLYV HC Y)- 
From this we infer that W,, being a factor of f,¢, must be a factor 
a: Alar = vhahles 9) 


We substitute this in (8) and cancel out W,. Proceeding in this way 
t 
me F(t Y= VAIVAY)  VUDLL N= VYLAE 9) 


an identity which proves our corollary. 


EXERCISE : 
If f(z, y) and (2, y) are polynomials, then any two sets of polynomials 


Py), 1% 9), Ry, y)s 
Py), Q2(% y), Rp(2, y), 
will be proportional to each other provided, 
(a) they satisfy the identities 
PL(YI@ YN=A@ YO y) +B 9)s 
PLY)S (HY) = W(t, WP Y) + Ro(* y); 
(b) there is no factor other than a constant common to Pj, Qi, and also no 


factor other than a constant common to P,, Q,; 
(c) R, and R, are both of lower degree in x than ¢. 


(Cf. Theorem 2, § 63.) 
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74. The Algorithm of the Greatest Common Divisor for Polyno- 


mials in Two Variables. We will consider the two polynomials in 


eae: f(a, y) = aly )e” + ayo" + + + aly), 


P(x, y) Hoya” + bi(y ur + + ony) 

and assume a, #0, 6,#0, n= m>0. 

Theorem 1 of the last section in combination with the results of 
§ 67 enables us to get all the common factors of fand ¢ which in- 
volve y only; for such factors must be common factors of all the a’s 
and all the 0’s. 

It remains, then, merely to devise a method of obtaining the com- 
mon factors of f and ¢ which do not themselves contain factors in y 
alone. We will show how this can be done by means of the algo- 


rithm of the greatest common divisor. 
Dividing f by ¢ (cf. § 63, Theorem 2), we get the identity 


PYF (a Y=QWa YH y) + R(x, 9). 


when R, is either identically zero, or is of lower degree in z than ¢. 
If Rk, #9, divide ¢ by F,, getting the identity 


Py) $a, Y)= O(a, Y)Ra, y) + Ria, y), 


where R, is either identically zero, or is of: lower degree in 2 than 
R,. It &, #0, divide R, by R,. Proceeding in this way, we get the 
following system of identities in which the degrees in 2 of R,, R,, --- 
continually decrease, so that after a certain number of steps we reach 
an f, say &,,,, which is independent of 2: 


PYF (@ Y)= Ole Y)b(% y) + Ry, y); 
Py)$(@ Y)= O(% YR (x, y)+ Rx, y), 
Py) Ria, y) = Of, y) Ria, y) + R(x, y), 


Pi_(y)Ry_f% ¥) = Qp_1(a, y)R,_(x,y)+ Ria, y), 
Py) Ry-(% Y) = Oe y)B(a, y) + Boss(y)- 


THEOREM 1. A necessary and sufficient condition that f and ¢ have 
a common factor which involves x is 


Ry1(y)= 9. 


er 
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In order to prove this theorem we first note that, by the first of the 
identities (1), any common factor of f and ¢ is a factor of R,, hence, 
by the second of the identities, it is a factor of R,, etc. Finally we 
see that every common factor of f and ¢$ is a factor of all the R’s. But 
R,., does not contain z. Hence if fand ¢ have a common factor 
which contains 2, R,.,(y)=0. 

Now suppose conversely that R,.,(y)=9, and let 


(2) E(x, y= S(Y)G(a, y)s 


where G has no factor in y alone.* The last identity (1) then tells 
us that P,(y) is a factor of 


O(a, Y)S(Y) F(a y), 


and since by hypothesis G has no factor in y alone, P,(y) must, by 
the Corollary of Theorem 2, § 73, be a factor of Q,S, that is 


(3) Q(x, y)S(y) = P,(y)H(a, y)- 


Substituting first (2) and then (8) in the last identity (1), and cancel- 
ling out the factor P,(y) from the resulting identity, as we have a 
right to do since P,(y) #9, we get the result 


Ry_(x, y)= H(z, y)G(a, y). 


That is, G is a factor not only of &, but also of R,_,. Accordingly 
we may write the next to the last identity (1) in the form 


Py) Ro, Y) =I (x y)G(a, 9): 


By the corollary of Theorem 2, § 78, we see that P,_,(y) is a factor 
of J, so that P,_,(y) can be cancelled out of this last written identity, 
and we see that G@ is a factor of F,_,. 

Proceeding in this way, we see that G is a factor of all the R’s, 
and therefore, finally, of fand ¢. Moreover, we see from (2) that 
G is of at least the first degree in x, as otherwise A, would not con- 
tain z, while #,,, was assumed to be the first of the #’s which did 
not involve z. 

Thus our theorem is proved. 

Since, as we saw above, every common factor of f and ¢ is also a 
factor of all the #’s, it follows from (2) that, if » isa common factor 


of f and ¢, G(x, y)S(y) = (2, y) K(a, y)- 


* If Rp has no factor in y alone, S reduces to a constant, 
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If then y contains no factor in y alone, S' must, by the Corollary of 
Theorem 2, § 73, be a factor of AK. Consequently by cancelling out 
S from the last written identity, we see that w is a factor of G. 
That is, 


Tueorem 2. Jf in Huclid’s algorithm R,,, =9, the greatest com- 
mon divisor of f and $ which contains no factor in y alone is the poly- 
nomial G(a,y) obtained by striking from R{x,y) all factors in y 
alone. 


We note that if R,,, is a constant different from zero; f and $ 
are relatively prime; but that the converse of this is not true as the 
simple example 

f =227 + 3y’, p=ux 
shows. 

Going back to the identities (1), we get from the first of these 
identities, by mere transposition, the value of A, in terms of f and 
¢@ (and Py, Q). Substituting this value in the second identity, we 
get a value for A, in terms of f, d, and certain P’s and Q’s. Pro- 
ceeding in this way, we finally get the formula 


Gy Roly) = Fa, yy F(a y) + P(e yo Y) 


where Ff and © are polynomials in (z, y). 


75. Factors of Polynomials in Two Variables. 


TuHeoreM 1. Jf f(a, y) and $(2, y) are any two polynomials in x 
and y, and W(x, y) ts an irreducible polynomial which is a factor of 
the product fp, then rr is a factor of f or of >. 


If y does not contain both 2 and y, this theorem reduces to 
Theorem 2, § 73. It remains, then, only to consider the case that p 
involves both variables. In this case, at least one of the polynomials 
Ff, @ must be of at least the first degree in 2. Without loss of gener- 
ality we may assume this to be f. If w is a factor of f, our theorem 
istrue. Suppose y is not a factor of f; then, since > is irreducible, f 
and yw are relatively prime, and if we apply the algorithm of the 
greatest common divisor to f and w (as we did in the last section to 
f and ¢) the first remainder #,,,(y) which does not involve z is not 
identically zero. The identity (4) of the last section now becomes 


(1) Roy) = FO fe y) + V@, Wv@ y)- 
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If we multiply this by $(2, y), the second member becomes a poly- 
nomial which has y as a factor, since, by hypothesis, f$ has pas a 
factor. We may therefore write 


(2) Bos (Y)P@ Y= Wa Y)X(% Y)- 


Now no factor other than a constant of R,,, can be a factor of p, 
since y is irreducible. Consequently, by the Corollary of Theorem 
2, § 73, R,,, is a factor of y(2, y). Cancelling out R,,, from (2), 
as we have a right to do since it does not vanish identically, we get 
an identity of the form 


PC Y= V(% Y) XA(% Y)3 


that is, y is a factor of ¢, and our theorem is proved. 
By applying this theorem a number of times, we get the 


CoRoLLARY. Jf the product of any number of polynomials in two 
eet File Was ) Aker Y) 
as divisible by an irreducible polynomial (a, y), then wv is a factor of 
at least one of the f’s. 

We come now to the fundamental theorem of the whole subject 
of divisibility of polynomials in two variables. 


THEOREM 2. A polynomial in two variables which is not identically 
zero can be resolved into the product of irreducible factors no one of 
which is a constant in one, and essentially in only one, way. 


That a polynomial f(x, y) can be resolved into the product of 
irreducible factors no one of which is a constant in at least one way 
may be seen as follows. If fis irreducible, no factoring is possible 
or necessary. If fis reducible, we have 


F(t, YN=F(% NILX Y)s 


where neither f, nor f, is a constant. If f; and f, are both irredu- 
cible, we have a resolution of f of the form demanded. If not, resolve 
such of these polynomials f, and f, as are reducible into the product 
of two factors neither of which is a constant. We thus get f ex- 
pressed as the product of three or four factors. This is the resolu- 
tion of f demanded if all the factors are irreducible. If not, resolve 
such as are reducible into the product of two factors, etc. This pro- 
cess must stop after a finite number of steps, for each time we factor 
ze 
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a polynomial into two factors, the degrees of the factors are lower 
than the degree of the original polynomial. We shall thus ulti- 
mately resolve f by this process into the product of irreducible 
factors, no one of which is a constant. 

Suppose now that f can be resolved in two ways into the product 
of irreducible factors none of which are constants, 


I(@ YHA Wo(% Y) + Fl ® Y) 
= b1(% Y)$o(% Y) ++ bil Y)- 


Since ¢, is a factor of f, it must, by the Corollary of Theorem 1, 
be a factor of one of the polynomials f,, f,, --- f, Suppose the f’s so 
arranged that it is a factor of f,. Then, since /, is irreducible, f, and 
¢, can differ only by a constant factor, and since ¢,#0, we may cancel 
it from the identity above, getting 


So S3 °° Se= baby br 


In the same way we see from this identity that f, and one of the 

$s, say d,, differ only by a constant factor. Cancelling ¢,, we get 
Cots + Sk=b3 °° be 

Proceeding in this way, we should use up the ¢’s before the /’s if 
L<k, the f’s before the ¢’s if 7 >. Neither of these cases is possible, 
for we should then have ultimately a constant on one side of the 
identity, and a polynomial different from a constant on the other. 
Thus we must have £=7. Moreover we see that the f’s can be 
arranged in such an order that each f is proportional to the corre- 
sponding ¢, and this is what we mean (cf. Definition 7, § 60) by 
saying that the two methods of factoring are not essentially different. 

Thus our theorem is proved. 


THEOREM 38. If two polynomials f and $ in (x, y) are relatively 
prime, there are only a finite number of pairs of values of (a, y) for 
which f and both vanish.* 


For if f and ¢ both vanished at the points 
(3) (@y Y1)) (Wa Yo)s 5 


and if these points were infinite in number, there would be among 
them either an infinite number of distinct 2’s or an infinite number of 
* Stated geometrically, this theorem tells us that two algebraic plane curves 


f(%, y)=9, (a, y)=0 van intersect in an infinite number of points only when they 
have an entire algebraic curve in common, 


— 


_— 
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distinct y’s. By a suitable choice of notation we may suppose that 
there are an infinite number of distinct y’s. Then it is clear that f 
and @ must be of at least the first degree in 2, since a polynomial in y 
alone which does not vanish identically cannot vanish for an infinite 
number of values of y. Wemay then apply tof and ¢ the algorithm 
of the greatest common divisor as in § 74, thus getting (cf. (4), 
§ 74) an identity of the form 


(4) F(x, YX, y+ O@ YG Y) = Roly) #9. 

Since the first member of (4) vanishes at all the points (8), R,.,(y) 
would vanish for an infinite number of distinct values of y, and this 
is impossible. 

An important corollary of the theorem just proved is that if f and 
¢@ are two irreducible polynomials in (x, y), and if the equations 
f=0 and ¢=0 have the same locus, then f and ¢ differ merely by 
a constant factor. This would, however, no longer be necessarily 
true if f and ¢ were not irreducible, as the example, 

f=zry, ¢=2'y, 
shows; for the two curves f=0 and ¢ = 0 are here identical, since 
the curve in each case consists of the two codrdinate axes, and yet f 
and ¢ are not proportional. By means of the following convention, 
however, the statement made above becomes true in all cases: 
Let f be resolved into its irreducible factors, 


SES SY Ses 
where f, --- f, are irreducible polynomials in (a, y), no two of which 
are proportional to each other. The curve f= 0 then consists of the 


k pieces, of 0, fg = 0, fy = 0. 


To each of these pieces we attach the corresponding positive integer 
«, which we call the multiplicity of this piece; and we then regard 
two curves given by algebraic equations as identical only when they 
consist of the same irreducible pieces, and each of these pieces has 
the same multiplicity in both cases. With this convention we may 
say: 

CorotuAry. Jf f and ¢ are polynomials in (a, y) neither of which 
is identically zero, a necessary and sufficient condition that the two curves 
f=0, 6=0 be identical is that the polynomials f and differ only by 
a constant factor. 
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EXERCISES 


1. Let f(z), (x), W(x) be polynomials in x whose coefficients lie in a certain 
domain of rationality. Then if w is irreducible in this domain and is a factor of 
the product f¢, prove that wy is a factor of f or of ¢. 


2. Let f(z) be a polynomial in z, which is not identically zero, and whose 
coefficients lie in a certain domain of rationality. Prove that fcan be resolved 
into a product of polynomials whose coefficients lie in this domain, which are 
irreducible in this domain, and no one of which is a constant, in one and essen- 
tially in only one way. 


3. Extend the results of this section to polynomials in two variables whose 
coefficients lie in a certain domain of rationality. 


76. Factors of Polynomials in Three or More Variables. The re- 
sults so far obtained in this chapter may be extended to polynomials 
in three variables without, in the main, essentially modifying the 
methods already used. We proceed therefore to state the theorems 
in the order in which they should be proved, leaving the proofs of 
most of them to the reader. The extension to the case of n variables 
then presents no difficulty, and is left entirely to the reader (cf. 
Exercise 1). 

Let f(x, y,2) be any polynomial in three variables, and suppose it 
arranged according to powers of z, 


S(, Ys @)Z Ay (Y, 2a" + AY, z)a™EF + + AY, 2), 
the a’s being polynomials in (y, 2). 
Corresponding to Theorems 1, 2 of § 73 we have 


THEOREM 1. A necessary and sufficient condition that a polynomial 
in (y, 2) be afactor of f is that it be a factor of all the a’s. 


THroreM 2. If f(a, y, 2) and (a, y, 2) are any two polyno 
mials in (x, y, 2) and Wy, 2) is an irreducible polynomial in (y, 2) only 
which is a factor of the product fp, then w is a factor of f or of ¢. 


CorotuaRy. Let f(x, y, 2) and (x, y, 2) be polynomials in 
(x, y, 2), and let Wy, 2) be a polynomial in (y, 2) alone. If wisa 
factor of the product of fp, but is relatively prime to d, thenit is a fac- 
tor of f. 

To find the greatest common divisor of two polynomials in three 
variables we proceed exactly as in the case of two variables, getting 
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a set of identities similar to (1), § 74, the P’s and R,,, being now 
functions of (y, 2), while the other R’s and the Q’s are functions of 
(x, y,z). Corresponding to Theorems 1, 2 of § 74 we now have 


THEOREM 3. A necessary and sufficient condition that f(x, y, 2), and 
P(x, y, 2) have a common factor which involves x is that R,,,(y, 2)= 0. 


THEorEM 4. Jf £,.,(y, 2)=0, the greatest common divisor of 
I(x, y, 2) and $(2, y, 2) which contains no factor in (y, 2) alone is 
the polynomial G(x, y, 2) obtained by striking out from R,(2, y, 2) 
all factors in (y, 2) alone. 

From the algorithm of the greatest common divisor for the two 
polynomials f(z, y, 2), (2, y,2) we also deduce the identity 


@) Rosy: 2) = La, ys 2) F(& y,2) + O(a, y, 2) h(a, y 2), 
similar to (4), § 74. 
Corresponding to Theorems 1, 2 of § 75 we have 


THEOREM 5. If f(x,y, 2) and $(a, y, 2) are any two polynomials 
and W(x, y, 2)1s an irreducible polynomial which is a factor of the prod- 


uct fp, then is a factor of f or of >. 


CoroLLARy. Jf the product of any number of polynomials 


F(@s Ys @) FA @s Ys 2) + F(@s Ys 2), 
is divisible by an irreducible polynomial (a, y, 2), then is a factor 
of at least one of the f’s. 

THEOREM 6. A polynomial in three variables which is not tdentt- 
cally zero can be resolved into the product of irreducible factors no one of 
which is a constant in one, and essentially in only one, way. 

When we come to Theorem 3, § 75, however, we find that it 
does not admit of immediate extension to the case of three vari- 
ables; for R,.,(y), which came into the proof of that theorem, 
becomes now #,,,(y, 2), and we can no longer say that this does not 
vanish at an infinite number of points (y,z). Not only is the proof 
thus seen to fail, but the obvious extension of the theorem itself is 
seen to be false when we recall that two surfaces intersect, in gen- 
eral, in a curve. 

This theorem may, however, be replaced by the following one: 

THEOREM 7. Jf f(x, y,2) and $(a, y, 2) are any two polynomials in 
three variables of which is irreducible, and if f vanishes at all points 
(x, y, 2) at which $ vanishes, then $ is a factor of f. 
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_In proving this theorem we may, without loss of generality, 
assume that actually contains one of the variables, say w; for 
if @ contains none of the variables 2, y, 2, the theorem is trivial 
and obviously true. 

Suppose ¢ were not a factor of f. Then, since ¢ is irreducible, 
f and ¢ are relatively prime. Hence, in the identity (1) above, 
Rois(y, 2) #9. Let us write 


(2) P(e y2)=bo(yr2)a" + bi(y, 2)" + + Only, 2)  (mZ1), 


where, without loss of generality, we may assume 6,(y,z)#0. Then 


(3) Bois (ys 2) 0(y, 2) #9. 
Accordingly we can find a point (y,, 2,) such that 
(4) | Rr Yy %)#% bo Yy%) #0. 


Consequently (x, Yy 2) 18 a polynomial in w alone which is of at 
least the first degree, and which therefore (Theorem 1, § 6) vanishes 
for some value z, of x That is 


b(%y Yy 2%) = 9. 
Accordingly, by hypothesis, 

SF (@yp Yp %) = 0. 
Referring now to the identity (1), we see that 

BoiiY %) = 9. 


This, however, is in contradiction with (4). Thus our theorem is 
proved. 

If to each part of a reducible algebraic surface we attach a mult?- 
plicity in precisely the same way as was explained in the last section 
for plane curves, we infer at once the 

CoroLtuARy. If f and ¢ are polynomials in (x, y, 2) neither of which 
is tdentically zero, a necessary and sufficient condition that the two sur- 
Faces f=0,6=0 
be identical is that the polynomials f and ¢$ differ only by a constant 
factor. 

Theorem 7 admits also the following generalization : 

THEOREM 8. Jf f(x, y.2) and $(a, y, 2) are any two polynomials in 
three variables which both vanish at the point (a. Yo, 2y) and of which & 
is irreducible, and tf in the neighborhood N of (xq. Yo, 2) F vanishes at 
all points at which } vanishes, then ¢ is a factor of f. 
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We assume, as before, that @ contains x and can therefore be 
written in the form (2). Let us first consider the case in which 
bo(Yo. 2%) #9. Here “the proof is very similar to the proof of 
Theorem 7. 

We obtain relation (3) precisely as above, and from it we infer 
that a point (y,, 2,) in as small a neighborhood M of (Yo %) as 
we please can be found at which the relations (4) are true. 

Now consider the equation 


(5) $0, Yn %1)=0. 


By writing ¢ in the form (2), we see that by taking the neighbor- 
hood M of (yp, 2)) sufficiently small, we can make the coefficients 
of (5) differ from the coefficients of = 


(6) P(X Yor %)=9 


by as little as we please (cf. Theorem 3, § 5). Now 2, is by hy- 
pothesis a root of equation (6). Consequently by taking WM suffi- 
ciently small, we can cause (5) to have at least one root 2, which 
differs from 2, by as little as we please (cf. Theorem 4,§ 6). Thus 
we see that a point (2, 4,,2,) in the given neighborhood WV of 
(2% Yor 2) can be found at which 


PZ, Yyp> %)= 9. 


Accordingly, by hypothesis, 
SF (2p Yy %)= 9. 


From the identity (1) we have then 
By (Y %)= 9, 


which is in contradiction with (4). Thus our theorem is proved on 
the supposition that b,(yo, 2) #0.* 


* The proof just given will, in fact, apply to the case in which not all the b’s in (2) 
vanish at the point (yo, Zo), if we use the extension of Theorem 4, § 6, which is there 
mentioned in a footnote. It is only the extreme case in which all the 6’s vanish at this 
point which requires the special treatment which we now proceed to give. The reader 
is advised to consider the geometrical meaning of this extreme case. 
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In order to treat the case in which 6,(y),2))= 9, let us denote by 
k the degree of ¢$(x,y,2), and let us subject this polynomial to a 
non-singular linear transformation 


e=a,7' +8 y' +2 
(7) Y = Mae! + Boy! + 92! 
pS a0! + Bsy' +72! 


which makes the degree of ¢ in x equal to the total Se kof $ 
(cf. Theorem 2, $64). 

Suppose that this transformation carries over the point (X,Y ;2) 
into the point (zj, yj, 24). Then it is possible, since (7) is non- 
singular, to take such a small neighborhood N' of (2), yj %) that 
all points in this neighborhood correspond to points in the given 
neighborhood WV of (a; Yo: 2): 

Moreover, by means of (7), ¢ has gone over into 


(8) d'(2', y', z’) = jan ca ily’, 2!al a aeOe bi(y’, z'), 


where 6) is a constant different from zero. Let us denote by 
f(a’, y', 2') the polynomial into which f is transformed. Then it 
is clear that, since, in the neighborhood WV, f vanishes whenever @ 
does, in the neighborhood WN’ (which corresponds to a part of V), f' 
vanishes whenever ¢’ does. Accordingly we can apply the part of 
the theorem already proved to the two polynomials f’ and @¢’, since 
the first coefficient of ¢' in the form (8), being a constant different 
from zero, does not vanish at (yj, 2). We infer that ¢’ is a factor 


! 
of f ’ ie, y', z") =o (a, y', 2!) W'(2', y', 2’), 


If here we replace a’, y', 2’ by their values in terms of 2, y, 2 from 
(T), we see that ¢ is a factor of f; and our theorem is proved. 


EXERCISES 


1. State and prove the eight theorems of this section for the case of polyno- 
mials in n variables. 


2. Extend the result of the exercise at the the end of §73 to the case of 
polynomials in n variables. 


3. Extend the results of the two preceding exercises to the case in which we 
consider only polynomials whose coefficients lie in a certain domain of ration- 
ality. 
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4. The resultant of two polynomials in one variable 
SF (@) Sagu” + a,aP3 4 oe + Any 
© h (&) Sbyumt dx A4 oe + Ding 


is sometimes defined as the polynomial R in the a’s and b’s of lowest degree which 
satisfies an identity of the form Ff +b=R, 


where F and © are polynomials in (4p, +++ nj boy +++ bm; x), and the identity is an 
identity in all these arguments. Prove that the resultant as thus defined differs 
only by a constant factor different from zero from the resultant as we defined it 
in § 68. 


CHAPTER XVII 


GENERAL THEOREMS ON INTEGRAL RATIONAL 
INVARIANTS 


77. The Invariance of the Factors of Invariants. Let us con- 
sider the general n-ary form of the kth degree which we will rep- 
resent by f(x, +++ U3 4) A +++), the w’s being the variables and the 
a’s the coefficients. By suitably changing the a’s, this symbol may 
be used to represent any such form. Hence, if we subject such a 
form to a linear transformation, the new form, being n-ary and of 
the same degree as the old, may be represented by the same func- 
tional letter: f(a}, --- 23 @, a5,-°--). This new form will evidently 
be homogeneous and linear in the a’s; that is, each of the a’s isa 
homogeneous linear polynomial in the a’s. It is also clear that each 
of the @’s is a homogeneous polynomial of the Ath degree in the 
coefficients of the transformation. 

It follows from the very definition of invariants that if we have 
a number of integral rational relative invariants of a form or system. 
of forms, their product will also be an integral rational relative in- 
variant. It is the converse of this that we wish to prove in this 
section. We begin by stating this converse in the simple case of a 
single form. 


THEOREM 1. Jf I(a,, ay, ---) is an integral rational invariant of 


the n-ary form 
y Fe TGs HOCH Ay, Ags vee), 


and is reducible, then all its factors are invariants. 


It will evidently be sufficient to prove that the irreducible factors 
of I are invariants. Let f,, f,, ---; be the irreducible factors of J. 
Subjecting f to the linear transformation 


Ce elie ! 
Ly = CX + + Cyn Lny 


Ik 
Ly = CyyVy af se Poke Gaeta 
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whose determinant we call c, and denoting the coefficients of UG 
transformed form by a}, aj, ---, we have 


(1) WG val) = OL (ay, ae *-+); 


an identity which may also be written 
FiQ> May 2) FA My My 0°) = OF (Ay gp +) Fly Ay ***)- 


We have here a polynomial in the e¢’s and _a’s which, on the 
second side of the identity, is resolved into its irreducible factors, 
since by Theorem 1, §61, the determinant ¢ is irreducible. Hence 
each factor on the first side is equal to the product of some of the 
factors on the second. That is 


(2) Flay hy °°) = {Ay Agy ---) (and, 22-7) 
where the ¢’s are polynomials. 
Now let 

SW pe the ea Oa 1, 


and let all the other ¢’s be zero. Our transformation becomes the 
identical transformation, the determinant ¢ = 1, and each a’ is equal 
to the corresponding a. The identities (2) therefore reduce to 


Filey Ay, )= (a4, Gas ++) (2 = 1, 2,... 1). 


Substituting this value of ¢; in (2), we see that f; is an invariant, 
and our theorem is proved. 
The general theorem, now, is the following : 


THEOREM 2. If I (a4, dg, +++ 3 54, bg, +++ 3 +++) 18 an integral rational 
invariant of the system of forms 


Flee “°° Dn} Ay, Ay vee) 


Sty 9 Dad b,, De s+) 


and is reducible, then all its factors are invariants. 


The proof of this theorem is practically identical with that of 
Theorem 1. 
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EXERCISE 

Tf I(a4, do, +++ 3 D1, Bay e885 8885 Yay Yn} Zi 2nj 17) IS an integral rational co- _ 
variant of the system of forms 

Fit, ++ %y 5 M1, Az, ***), 

Fat, + Ln 3 D1, ba, +), 


and the system of points (71, +++ Yn), (21) ***2n),***, and is reducible, then all its 
factors are covariants (or invariants). 


78. A More General Method of Approach to the Subject of Rela- 
tive Invariants. We have called a polynomial J in the coefficients 
of an n-ary form f a relative invariant of this form if it has the 
property of being merely multiplied by a power of the determinant 
of the transformation when f is subjected to a linear transformation. 
It is natural to inquire what class of functions J we should obtain if 
we make the less specific demand that Z be multiplied by a poly- 
nomial in the coefficients of the transformation. We should expect 
to get in this way a class of functions more general than the invyari- 
ants we have so far considered. Asa matter of fact, we get precisely 
the same class of functions, as is shown by the following theorem: 


THEOREM. Let I be a polynomial not identically zero in the co- 
efficients (ay, ay,++-) of an n-ary form f, and let (aj, a4, ++.) be the co- 
efficients of the form obtained by subjecting f to the linear transformation 


ey ! 
Ly = Cy XM Fo F Cyn Fny 


Dn = Cn Loe + Cath. 
Lh Tay, oy +++) = WC Cys +++ Cnn) Lays Ags +++), 
where ris a polynomial in the c’s, and this is an identity in the a’s and 
c's, then is a power of the determinant of the transformation. 


We will first show that ~+0 when c+0. If possible let d,,,-+-dnn 
be a particular set of values of the ¢,,’s such that 


Wd, ae dan) es 0, 
while dj, sist din 
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Then the transformation 
Hie / 
y= Aye + + + dyntny 
= I 
Ln = njeyt coo Quint 


: ‘ie, 
has an inverse Hh = Oy hy He + Oy nny 


vay eee 
Let us consider a special set of a’s such that I(a,, a, ---)+0. Then 
T (ay, 3) +++) = WA yyy +++ Inn) Lays Ags +) = 0. 
Now apply the inverse transformation, and we have 
(ay yy =) = WSq19 + San) (ay Ay) = 9, 


which is contrary to our hypothesis. 
Having thus proved that y can vanish only when c=9, let us 
break up ¥ into its irreducible factors, 


Wl Cy °° Cnn) = Wil Cras *** Cnn) Wal eras *** Cnn) °° Wal eqs *** Cnn) 


Since y vanishes whenever W;=0, Wy; can vanish only when c= 0. 
Hence by the theorem for n variables which corresponds to Theorem 7, 
§ 76, x; must be a factor of c. But eis irreducible. Hence y; can 
differ from ¢ only by a constant factor, and we may write 


v= Ker. 


It remains then merely to prove that the constant / has the value 1. 
For this purpose consider the identity 


I (ah, aj, +++) = KAT (ay, dy +++), 


and give to the ¢,’s the values which they have in the identical 
transformation. Then c=1, and the a’’s are equal to the correspond- 
ing a’s. The last written identity therefore becomes 


I(a,, Ao» >) = KL (4a, Ags r)3 


from which we infer that K = 1. 
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EXERCISES 


1. Prove thatif a polynomial J in the coefficients a1, a, --- of an n-ary form and 
the coérdinates (1, +++ Ym) of a point has the property of being merely multiplied 
by a certain rational function y of the coefficients of the transformation when the 
form and the point are subjected to a linear transformation, then w is a positive or 
negative power of the determinant of the transformation, and J is a covariant. 


2. Generalize the theorem of this section to the case of invariants of a system 
of forms. 


3. Generalize the theorem of Exercise 1 to the case of a system of forms and 
a system of points. : 


4. Prove that every rational invariant of a form or system of forms is the 
ratio of two integral rational invariants. 


5. Generalize the theorem of Exercise 4 to the case of covariants. 


79. The Isobaric Character of Invariants and Covariants. In 
many investigations, and in particular in the study of invariants and 
covariants, it is desirable to attach a definite wezght to each of the 
variables with which we have to deal. To a product of two or more 
such variables we then attach a weight equal to the sum of the 
weights of the factors, and this weight is supposed to remain 
unchanged if the product is multiplied by a constant coefficient. 
Thus if 2,, 2,, 23 are regarded as having weights wy, w., ws respectively, 
the term ee. ae 

172% 
would have the weight W, + Wy + ws. 


If, then, having thus attached a definite weight to each of the vari- 
ables, we consider a polynomial, each term of this polynomial will be 
of a definite weight, and by the weight of a polynomial we understand 
the greatest weight of any of its terms whose coefficient is not zero. If 
moreover all the terms of a polynomial are of the same weight, the 
polynomial is said to be isobarie. 

It may be noticed that, according to this definition, a polynomial 
which vanishes identically is the only one which has no weight, while 
a polynomial which reduces to a constant different from zero is of 
weight zero. Moreover if two polynomials are of weights w, and w,, 
their product is of weight w, + w,.* 


* The conception of degree of a polynomial is merely the special case of the con- 
ception of weight in which all the variables are supposed to have weight 1. The con- 
ception of being isobaric then reduces to the conception of homogeneity. 
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We will apply this conception of weight first to the case in which 
the variables of which we have been speaking are the coefficients 
45 Ms, + Of the n-ary form 


T(&is eee Tn 15 Ao) os), 


We shall find it desirable to admit n different determinations of the 
weights of these a’s; one determination corresponding to each of the 
variables z,, +++ 2,. j 


n 


DEFINITION 1. Jf a; is the coeffictent of the term 
A; WA Ub? ++ ohn 


in an n-ary form, we assign to a; the weights py Dos: Py respectively 
with regard to the variables x1, Ly, +++ Lp» 


In the case of a binary form, 
ks k-1 a k 
a) tj + a,a4 1a, + --- + 4,24, 


the subscripts of the coefficients indicate their weights with regard 
to 2, while their weights with regard to zx, are equal to the differences 
between the degree of the form and these subscripts. 

As a second example, we mention the quadratic form 


n 
2a, L;,L;. 


Here the weight of any coefficient with regard to one of the vari- 
ables, say x;, is equal to the number of times y occurs as a subscript 
to this coefficient.* 

In connection with: this subject of weight, the special linear 
transformation Pn, (ins) 
(1) [isaees é 


0, =z, 

is useful. If a; is a coefficient which is of weight X with regard to 
aj the term in which this coefficient occurs contains a, and therefore 
a, = kk Aj 


* For forms of higher degree, a similar notation for the coefficients by means of 
multiple subscripts might be used. The weight of each coefficient could then be at 
once read off from the subscripts. 
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That is 


THEOREM 1. The weight with regard to x, of a coefficient of an 
n-ary form is the exponent of the power of k by which this coefficient is 
multiplied after the special transformation (1). 


From this it follows at once that an isobaric polynomial of 
weight A with regard to a; in the coefficients (a, a,,---) of an n-ary 
form is simply multiplied by %’* if the form is subjected to the linear 
transformation (1). 

Moreover, the converse of this is also true. For if a}, a},--- are 
the coefficients of the n-ary form after the transformation (1), and if 
(44, A --:)is a polynomial which has the property that 


p( a, Ay, ve) = kK h(a, ae, vee), 


this being an identity in the a’s and also in k, we can infer, as fol- 
lows, that ¢ is isobaric of weight >. Let us group the terms of ¢ 
together according to their weights, thus writing ¢ in the form 


(a4, a )= $1 (a, As s+) + o( a4, Ans +++) +... 
where qj, ¢,, --- are isobaric of weights ),, A, ---.. We have then 
PCAs yy +++) = HM hy (Uys gy +++) + hy (Ays gy 02) Hooves 
But on the other hand 7 
P( a4 Uy, ++*) = AP Ay, Ags +++) = Hh (yy gy +++) + hg (ay, Ay ns tee 


Comparing the last members of these two identities, we see that 
N=AY=AQ= 


as was to be proved. We have thus established the theorem: 


THEOREM 2. A necessary and sufficient condition that a poly- 
nomial in the coefficients of an n-ary form be simply multiplied by kr 
when the form is subjected to transformations of the form (1) is that 
be isobaric of weight r with regard to x,. 


By means of this theorem we can show that the use of the word 
weight introduced in §381 is in accord with the definition given in 
the present section. For an integral rational invariant of ‘an n-ary 
form which, according to the definition of §31, is of weight » will, if 
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the form is subjected to the transformation (1), be merely multiplied 
by # and must therefore, according to Theorem 2, be isobaric of 
weight A with regard to 2, That is: 


THEOREM 38. Jf I is an integral rational invariant of a form f 
which according to the definition of $31 is of weight Xr, tt will also be of 
weight % with regard to each of the variables x; of f according to the 
definitions of this section, and it will be isobaric with regard to each of 
these variables. 

As an illustration of this theorem we may mention the discrimi- 
nant Ay ty — a2 
of the binary quadratic form 

AX} + 24, 2,%_ + Ay 22 


which is isobaric of weight 2 both with regard to x, and with regard 
tO Xp. 

The reader should consider in the same way the discriminant of 
the general quadratic form. 

All of the considerations of the present section may be extended 
immediately to the case in which we have to deal, not with a single 
form, but with a system of forms. We state here merely the 
theorem which corresponds to Theorem 38. 


THEOREM 4. Jf Lis an integral rational invariant of a‘system of 
forms which according to the definition of $31 ts of weight X, it will also 
be of weight X with regard to each of the variables x, of the system, and 
at will be isobaric with regard to each of these variables. 

The reader may consider as an illustration of this theorem the 
resultant of a system of linear forms, and also the invariants obtained 
in Chapters XII and XIII. 

We saw in Theorem 5, § 31, that the weight of an integral rational 
invariant cannot be negative. This fact now becomes still more 
evident, since the weight of no coefficient is negative. Moreover, 
we can now add the following further fact: 


THEOREM 5. An integral rational invariant of a form or system 
of forms cannot be of weight zero. 

For consider any term of the invariant whose coefficient is not 
zero. This term involves the product of a number of coefficients of 
the system of forms. Since none of these coefficients can be of nega- 

Q 
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tive weight, the weight of the term will be at least as great as the 
weight of any one of them. But any one of them is at least of 
weight 1 with regard to some one of the variables. Hence the in- 
variant is at least of weight 1 with regard to some one of the 
variables, and hence with regard to any of the variables. 

In order, finally, to be able to extend the considerations of this 
section to the case of covariants, we must lay down the following 
additional definition: 


DEFINITION 2. Jf the sets of variables (yy, +++ Yn)s (15 ++ Sn)s 07° 
are cogredient with the variables (a, --- ,) of a system of n-ary forms, 
we will assign to Yj, %, ++ the weight —1 with regard to x,, to all the 
other y's, 2s, etc. the weight 0. 

It will be noticed that here too, when we perform the transform- 
ation (1), each of the variables is multiplied by a power of & whose 
exponent is the weight of the variable. It is therefore easy* to 
extend the considerations of this section to this case, and we thus 
get the theorem: 


THEOREM 6. Jf I is an integral rational covariant of a system 
of forms and a system of points which is of weight according to the 
definition of § 31, tt will also be of weight X with regard to each of the 
variables of the system, and it will be isobaric with regard to each of 
these variables. 


As an example of this theorem we note that the polar 
My Y 2 + Oy (Yy2q + Yo2%j) + A Yo% 


of a binary quadratic form is isobaric of weight zero. The reader 
may satisfy himself that the same is true of the polar of the general 
quadratic form. 


80, Geometric Properties and the Principle of Homogeneity. It 
is a familiar fact that many geometric properties of plane curves or 
surfaces are expressed by the vanishing of an integral rational func- 
tion of the coefficients of their equations. Take, for instance, the 
surface 


(1) Si CBs Y, 23 Ay, Ag, --)= 0, 


* Slight additional care must be taken here on account of the possible presence of 
terms of negative weight. 
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where f is a polynomial of the Ath degree in the non-homogeneous 
coordinates 2, y, 2, and a, d,,--- are the coefficients of this polyno- 
mial; and consider the relation 


(2) Play Ay ++) = 0, 


where ¢ is a polynomial, which we will assume to be of at least 
the first degree, in the coefficients a,, a,,---. By Theorem 3, § 6, 
there are an infinite number of polynomials of the &th degree in 
(x, y, 2) whose coefficients satisfy the relation (2) and also an infinite 
number whose coefficients do not satisfy this relation. In other 
words, all polynomials of the Ath degree in (a, y, 2) may be divided 
into two classes, A and B, of which the first is characterized by con- 
dition (2) being fulfilled, while the second is characterized by this 
condition not being fulfilled. We may therefore say that (2) is a 
necessary and sufficient condition that f have a certain property, 
namely, the property of belonging in class A. 

The simplest examples, however, show that this property of f 
need not correspond to a geometric property of the surface (1). 
To illustrate this, let k=1, so that we have 


fz at + AY + Ag2@+ dy 


and consider first the polynomial in the a’s: 
o= a, 


The vanishing of ¢ gives a necessary and sufficient condition that f 
belong to the class of homogeneous polynomials of the first degree 
in (z, y, 2), and thus expresses a property of the polynomial. This 
same condition, a,=0, also expresses a property of the plane f= 0, 
namely, the property that it pass through the origin. 
Suppose, however, that instead of the function ¢ we take the 
polynomial $,=4,—1. 
The vanishing of this polynomial also gives a necessary and suffi- 
cient condition that the polynomial f have a certain property, namely, 
that its constant term have the value 1. It does not serve to dis- 
tinguish planes into two classes, since we may write the equation of 
any plane (except those through the origin) either with the constant 
term 1 or with the constant term different from 1 by merely multi- 
plying the equation through by a constant. 
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From the foregoing it will be seen that saying that a surface has 
a certain property amounts to the same thing as saying that it 
belongs to a certain class of surfaces.* 


THrorEM 1. The equation (2) expresses a necessary and sufficient 
condition for a geometric property of the surface (1) when, and only 
when, the polynomial is homogeneous. 


For if ¢ is non-homogeneous, let us write it in the form 
b=, + brat — + $,+ go 


where ¢, is a homogeneous polynomial of the nth degree and each 
of the other ¢’s which is not identically zero is a homogeneous poly- 
nomial of the degree indicated by its subscript. Let aj, aj, --: bea 
set of values of the a’s for which ¢, and at least one of the other ¢,’s 
is not zero, and consider the surface 


(3) F (a, Y, 25 ca, CR, -)=0. 
The condition (2) for this surface is 
eb, (a4, ayy oes ee o1¢,_(a, Abs ee cee fb ch, (at, a}, +++) 
+ ho(a}, ah, +++) = 0. 


This is an equation of the nth degree in.c, and since at least one 
of the coefficients after the first is different from zero, it will have 
at least one root c,#0. On the other hand, we can find a value 
¢,#0 which is not a root of this equation. Hence the surface (8) 
satisfies condition (2) if we let ¢ =c, and does not satisfy it if e=«¢. 
But a change in the value of ¢ merely multiplies the equation (3) 
by a constant and does not change the surface represented by it. 
Thus we see that one and the same surface can be regarded both as 
satisfying and as not satisfying condition (2). In other words, if ¢ 
is non-homogeneous, (2) does not express a property of the surface (1). 

Assume now that ¢@ is homogeneous of the mth degree, and 
consider the class A of polynomials f whose coefficients satisfy equa- 
tion (2) and the class B whose coefficients do not satisfy this equa- 
tion. Our theorem will be proved if we can show that we have 
hereby divided the surfaces (1) into two classes, that is, that if 


* This brief explanation must not be regarded as an attempt to define the concep- 
tion property, for no specific class can be defined without the use of some property. 
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Q{, @,--- are the coefficients of a polynomial of class A and a’, al, -.. 
the coefficients of a polynomial of class B, then the two surfaces 

+ fle yes al, aly) =0, 

SB Ys % Ay ag, +) = 0, 
cannot be the same. If they were the same, the coefficients aj, a}, --- 
would be proportional to a{, a}, --- (cf. Theorem 7, Corollary, § 76), 
Gy 00, A, =a), 
and therefore (i, By, ---) = eal, ab, +-*). 
But this is impossible since by hypothesis 
P(A, My) = 0, P(A, ay, +) #0. 

Thus our theorem is proved. 

This theorem admits of generalization in various directions.. 
Suppose first that instead of a single surface (1) we have a system 
of algebraic surfaces, and that ¢ is a polynomial in the coefficients of 
all these surfaces. Then precisely the reasoning just used shows that 
the equation ¢=.0 gives a necessary and sufficient condition for a 
geometric property of this system of surfaces when and only when ¢ 
is homogeneous in the coefficients of each surface taken separately. 

On the other hand, we may use homogeneous coérdinates in 
writing the equations of the surfaces, and the results so far stated 
will obviously hold without change: 


THEOREM 2. Let 
F(a, Y zy t; 15 Ao, r+), aes Y; 2, t; b,, by, vy eg 
be a system of homogeneous polynomials in (x, y, 2, t) whose coefficients 
ATE Ay, Ag, -++3 by, bg, --+3 ete. and let 
an ep ee ee 
be a polynomial in the a’s, b’s, etc. Then the equation = 0 expresses 
a necessary and sufficient condition that the system of surfaces 


i= 0, j= QO, ++ 
have a geometric property when, and only when, the polynomial > is 
homogeneous in the a’s alone, also in the b’s alone, ete. 
In conclusion we note that all the results of this section can be 


extended at once to algebraic curves in the plane ; or, indeed, to the 
case of space of any number of dimensions. 
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EXERCISE 


If, in Theorem 2, besides the surfaces f; =0, f2 = 0, -+- we also have a system 


Se yn (Gr Yy *1y ti), (x2, Y2, 225 t2), ac 


and if # is a polynomial not merely of the a’s, b’s, etc., but also of the coérdinates 
of these points, prove that @=0 expresses a necessary and sufficient condition 
that this system of surfaces and points have a geometric property when and 
only when ¢ is homogeneous in the a’s alone, in the 0’s alone, etc., and also in 
(21, Y1, 21, 41) alone, in (22, Y2, 22, t2) alone, etc. 


81. Homogeneous Invariants. From the developments of the last 
section it is clear that the only integral rational invariants which 
will be of importance in geometrical applications are those which are 
homogeneous in the coefficients of each of the ground-forms taken 
separately.* Such invariants we will speak of as homogeneous in- 
variants. It will be found that all the invariants which we have 
met so far are of this kind. 

An important relation between the weight and the various de- 
grees connected with a homogeneous invariant is given by the follow- 
ing theorem : 


THEOREM 1. Jf we have a system of n-ary forms, 
Fe re hay lee -)y 


(1) Fa(@p 0+ ns Oy Bas ++) 


of degrees m4, Mo, «+: respectively, and if 
FEC ee tem, SN 


* This statement must not be taken too literally. It is true if in the geometrical 
application in question we consider the variables as homogeneous codrdinates and if 
we have to deal with the loci obtained by equating the ground-forms to zero, While 
this is the ordinary way in which we interpret invariants geometrically, other inter- 
pretations are possible. For instance, instead of interpreting the variables (a, y) as 
homogeneous codrdinates on a line and equating the binary quadratic forms 

Si Zaye? + 2 agay + asy, 
So= bia? + 2 boxy + dgy?, 
to zero, thus getting two pairs of points on a line, we may interpret (a, y) as non- 


homogeneous coordinates in the plane, and consider the two conics fj; =1, f2=1. With 
this interpretation, the vanishing of the invariant 


aids — as + bibs — 63, 


which is not homogeneous in the a’s alone or in the b’s alone, has a geometric meaning. 
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is a homogeneous invariant of this system, of weight r, and of degree a 
in the a’s, B in the b’s, ete., then 


(2) “  metm B+ =m. 


Subjecting the forms (1) to the linear transformation 
| Uy = CyB bo +O 
(3) 


= / I 
En = CnyXy + e+ + Cnn 2ny 


whose determinant we will denote by ¢, we get 


IAs Sa Cyne a Abs s+), 
eke. avaie Mee bi Us +), 


and, since by hypothesis J is an invariant of weight 2, 
(4) I (a, as ree bt, bi, 5a5% -)= CF (a,, gy °° 6, by, see vee), 


Every a’ is a homogeneous polynomial in the ¢,’s of degree m,, every 
b' of degree m,, etc. ; and since J is itself homogeneous of degree « 
in the a’s, 8 in the 0’s, etc., we see that the left-hand side of (4) is a 
homogeneous polynomial of degree mz+m,@8+ ---in the ¢’s. 
Equating this to the degree of the right-hand side of (4) in the ¢,’s, 
which is evidently md, our theorem is proved. 

An additional reason for the importance of these homogeneous 
invariants is that the non-homogeneous integral rational invariants 
can be built up from them, as is stated in the following theorem: 


THEOREM 2. Jf an integral rational invariant I of the system (1) 


be written in the form Pe ie Tetra til, 


where each of the I;s is a polynomial in the a’s, b's, ete., which is 
homogeneous in the a’s alone, and also in the b's alone, etc., and such 
that the sum of no two Is has this property, then each of the functions 


Pel aed, 


is a homogeneous invariant of the system (1). 
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This theorem follows immediately from the definition of an 
invariant. For from the identity, 


La}, ay erate O45 On eS ve) siete +a, as sisie) oF bbs see 5 +++) 
=o[L(a,, A) s\eiy b,, On cues vee) 
+- eee + 1a, Ags eieie) ® b,, by, recurs -)], 


we infer at once the identities, 


Lai, dy sees b!, bbs see )=ecAL(a, gy 29° 5 b,, bo, cee r+), 


! ! ° / / ‘ a . . 
Ta, gn = b, Oi sees )=eL(ay, Bos **2 5 b,, Ue eee 5 vee). 


In the case of a single n-ary form, but in that case only, we have 
the theorem: 


THEOREM 38. An integral rational invariant of a single n-ary form 
is always homogeneous. 
Let Tact? Des Gas Ay, +) 


be the ground-form, and let J be the invariant. By Theorem 2 we 
may write Te eas 
where J,, --- J, are homogeneous invariants. Let the degrees of these 
homogeneous invariants in the a’s be «,,-: a, respectively. Their 
weights are all the same as the weight of J, which we will call 2. 
If, then, we call the degree of f, m, we have, by Theorem 1, 


Moy = NA, Moy = NA, +++ Mee, = NA, 


from which, since m > 0, we infer 
a, = hy = coe == 


Mae ia el, <3 Jy 


are of the same degree, and J is homogeneous. 


THEOREM 4. If we have a system of n-ary forms fi, fy, ++ and 
a polynomial p in their coefficients, the equation d = 0 gives a necessary 
and sufficient condition for a projective property of the system of loct in 
space of n—1 dimensions, 


F,=9, fp= 0,00 ; 


when, and only when, p is a homogeneous invariant of the system of 


forms f. 
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If $ is a homogeneous invariant, its vanishing gives a necessary 
and sufficient condition for a geometric property (cf. § 80), and this 
property must be a projective property since when we subject the 
loci to a non-singular collineation, @ is merely multiplied by a non- 
vanishing constant. 

On the other hand let ¢=0 be a necessary and sufficient condi- 
tion for a projective property. In order to prove that ¢ is an 
invariant (it must be homogeneous by § 80) let a,, a,,+++ be the 
coefficients of f,; 8, b,,+++ the coefficients of Jo etc.; and suppose 
that the linear transformation, 


By = Oy yy + vs + Cy Wry 
Cn = Cn cua + Coals 
carries over f, into ff with coefficients a}, a) ---; jf, into f, with coeffi- 
cients bj, bf,---; ete. The polynomial ¢ formed for the transformed 
forms is als dies Oi, Lins; cs2), 
and may, since the a’’s, 6/’s, --- are polynomials in the a’s, 6’s, --- and 
the e’s, be itself regarded as a polynomial in the a’s, 0’s, --- and the 
e's. Looking at it from this point of view, let us resolve it into its 
irreducible factors, 
me a ail aiss esi d,,.d,, 1-23), be eo ee ce om) 
soma’ (ay, Gg, ==> 5 b,, Dos mr8 Ste Ch eos oe: 
It is clear that at least one of the factors on the right must con- 
tain the c's. Let ¢, be such a factor, and let us arrange it as a poly- 
nomial in the ¢’s whose coefficients are polynomials in the a’s, 0’s, 


etc. Let (Gy, Ay 2 3 By, Boy oe 5) 


be one of these coefficients which is not identically zero and which is 
the coefficient of a term in which at least one of the e’s has an expo- 
nent greater than zero. We can, now, give to the a's, b’s, --- values 
which we will denote by A’s, B’s, --- such that neither @ nor 
vanish ; and consider a neighborhood W of the point 


(A, a eae B,, By welts oo) 
throughout which 
(7) P( 44, gy +05 Byy Bgy «085 +) #0, 
(8) Way, Ag, °°" 5 b,, Dy, IES) + )# 0. 
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Let us now restrict the a’s, 6’s, «+» to the neighborhood Wand ask 
ourselves under what circumstances we can have ¢,=0. If this 
equation is fulfilled, we see from (6) that ¢ vanishes for the trans- 
formed loci, while, by (7), it does not vanish for the original loci. 
Since, by hypothesis, the vanishing of ¢ gives a necessary and 
sufficient condition for a projective property, a transformation (5) 
which causes ¢ to vanish when it did not vanish before must be a 
singular transformation. That is, ¢f the a’s, b’s, --. are in the neigh- 
borhood N, whenever $, vanishes the determinant c of (5) vanishes. 
Moreover, ¢, does vanish for values of the a’s, 6’s, --- in WV, for if we 
assign to the a’s, 6’s, --- any such values, ¢, becomes a polynomial in 
the ¢;,s, which, by (8), is of at least the first degree, and therefore 
vanishes for suitably chosen values of the ¢,’s. We can therefore 
apply the theorem for more than three variables analogous to Theorem 
8, § 76, and infer that ¢, is a factor of the determinant ¢ ; and conse- 
quently, since this determinant is irreducible oe 1, § 61), that 
¢, is merely a constant multiple of e. 

The reasoning we have just applied to ¢, applies equally to any 
of the factors on the right of (6) which are of at least the first degree 
in the ¢,’s. Accordingly (6) reduces to the form 


(9) ; p(a}, Ops goo Os bb se JS oy(a,, Ag °° 3 6 or eg “e+, 


where x no longer involves the ¢,’s. To determine this polynomial 
x, let us assign to the e¢,’s the values 0, 1 which reduce (5) to the 
identical transformation. Then the q’’s, 6’’s,--- reduce to the a’s, 
b’s ---, while c=1; so that from (9) we see that 


P( a4, Any °° 5 b,, by wee --) = y(a,, Gans: b,, bas eee +). 


Substituting this value of y in (9), we see that ¢ is really an in- 
variant. 

In order to avoid all misunderstanding, we state here explicitly 
that if we have two or more polynomials, $,, ¢,, --- in the coefficients 
of the forms fj, the equations ¢,=¢,= --- =0 may be a necessary and 
sufficient condition for a projective property of the loci f;=0, even 


though ¢,, ¢,, -+: are not invariants. For instance, a necessary and 
sufficient condition that the two lines 


AX + Ay%, + gx, = 0, 


b,x, + b,% + b.7,== 0 
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coincide is the vanishing of the three two-rowed determinants of 


the matrix See as 
| Dy 05.205 


b) 








none of which is an invariant. Or, again, a necessary and sufficient 
condition that a quadric surface break up into two planes, distinct 
or coincident, is the vanishing of all the three-rowed determinants 
of its matrix, and these are not invariants. In this case we can also 
express the condition in question by the iddentical vanishing of a 
certain contravariant, namely, the adjoint of the quadratic form; 
and this—a projective relation expressed by the identical vanish- 
ing of a covariant or contravariant —is typical of what we shall 
usually have when a single equation ¢=0 is not sufficient to express 
the condition. There are, however, cases where the condition is given 
by the vanishing of two or more invariants; cf. Exercise 6, § 90. 


EXERCISES 


1. Prove that if in Theorem 1 our system consists not merely of the ground- 
forms (1) but also of certain points 


(W1) tee Yn)> (a1, eee Bodie trey 
and we have not an invariant J but a covariant of weight A, and of degreea in the 
a’s, B in the 0’s, etc., 7 in the y’s, € in the 2’s, etc., then 
mya. + m23 + --=nd+ y+ 04-0. 
2. Extend Theorem 2 to the case of covariants. Does Theorem 3 admit of 
such extension ? 
3. Extend Theorem 4 to the case of covariants. 


4. Show that an integral rational invariant of a single binary form of odd 
degree must be of even degree. 


5. Show that the weight of an integral rational invariant of a single binary 
form can never be smaller than the degree of the form. 


6. Express the condition that (a) two lines, and (b) two planes coincide, in 
the form of the identical vanishing of a covariant or contravariant. 


7. Prove that a polynomial in the coefficients of a system of n-ary forms which 
is homogeneous in the coefficients of each form taken by themselves, and which is 
unchanged when the forms are subjected to any linear transformation of determi- 
nant + 1, is an invariant of the system of forms. 


8, Generalize Exercise 7 to the case of covariants, 
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82. Resultants and Discriminants of Binary Forms. If we inter- 
pret (x,,%,) as homogeneous codrdinates in one dimension, the equa- 
tions obtained by equating the two binary forms 


F (By) Lq) = My UG + AUT 1 ty + ++ + Oy By 
= J m—1 m 
P(@y) Hg) = bya + byap tay + + bm ae 


to zero represent sets of points ona line. The points given by the 
equation f= 0 are the points at which the linear factors of f vanish, 
and the points corresponding to ¢=0 are the points at which the 
linear factors of ¢ vanish. Since two binary linear forms obviously 
vanish at the same point when, and only when, these linear forms are 
proportional, it follows that the loci of the two equations f= 0, ¢=0 
have a point in common when, and only when, f and ¢ have a common 
factor other than a constant. Hence, by § 72, a necessary and suffi- 
cient condition that the two loci f=0, 6=0 have a point in common is 
that the resultant R of the binary forms f, d vanish. 

The property of these two loci having a point in common is, 
however, a projective property. Thus, by Theorem 4, § 81, 


THEeorEeM 1. The resultant of two binary forms is a homogeneous 
invariant of this pair of forms. 
From the determinant form of R given in § 68 it is clear that R 
is of degree m in the a’s and of degree m in the 6’s. Hence by 
ae ee. 
formula (2), § 81, eee 
THEOREM 2. The weight of the resultant of two binary forms of 
degrees m and n is mn. 


The following geometrical problem will lead us to an important 
invariant of a single binary form. 

Let us resolve the form f, which we assume not to be identicaily 
zero, into its linear factors (cf. formula (4), § 65), 

S(®yy Uy) = (My — Hq) (gy — ULHy) +++ (OHH — OL, y). 

The equation f= 0 represents n distinct points provided no two of 
these linear factors are proportional to each other. If, however, two 
of these factors are proportional, we say that f has a multiple linear 
factor, and in this case two or more of the n points represented by 


the equation f= 0 coincide. Let us inquire under what conditions 
this will occur. 
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Form the aa derivatives : 


= af (alr> ae - (ofr, — a},29) 


ott (oef/ay — ma) a OL) +++ (Oy — O,2o) 


Gy |, to + alla, ery) ~- (alsa hat, 
ae ae oe Cas ah (alla, — a, ZL) 


a al, (at) £4 — wy) (03 24 — O50) + (ee — a ) 


SS gg al, (ax, — aay) + > (oll yay — 0, 4%). 


From these formule we see that any multiple linear factor of 
J is a factor of both of these partial derivatives. 

Conversely, if these partial derivatives have a common linear 
factor, it must be a factor of f on account of the formula, 


OF, 
Ly = nf, 
laa Ly a" 





on es 


a formula which follows immediately from the expressions, 


if 

jn Nae 1+ (n— Lary 2a + ++ + 4,125 4, 
1 

af 

OX, 


(2) 


= act 14 2a x7 22, + ++ + nag! 


But, by (1), no linear factor of f can be a factor of 0f/dx, unless 
it is a multiple factor of f. Thus we have proved 


THEOREM 3. A necessary and sufficient condition that f have a 
multiple linear factor is that the resultant of 0f/dr, and Of/dx, vanish. 


Derrnition. The resultant of 0f/dx, and df/dzx, is called the dis- 
criminant of f. 


From (2) we see that the discriminant of f may be written as a 
determinant of order 2m — 2 whose elements, so far as they are not 
zero, are numerical multiples of the coefficients a), a,, --- a, of f. 
That is, this discriminant is a polynomial in the a’s. Moreover, its 
vanishing gives a necessary and sufficient condition that the locus 
f=0 have a projective property (namely, that two points of this 
locus coincide). Hence, by Theorem 4, § 81, this discriminant is a 


* This is merely Euler’s Theorem for Homogeneous Functions, 
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homogeneous invariant, whose degree and weight are readily deter- 
mined. Thus we get the theorem : 


THrorem 4. The discriminant of a binary form of the nth 
degree is a homogeneous invariant of this form of degree 2(n —1) and 
of weight n(n — 1). 

A slight modification in the definition of the discriminant is often 
desirable. Let us write the binary form jf, not in the above form 
where the coefficients are a, @,-:-4,, but, by the introduction of 
binomial coefficients, in the form : 


y nn—1 ~ = 
F(@y) Lq) = Ur + N42 y+ MDD aga Ge fh + NAy_1 2x5 1+4,29. 


Then we may write 


1a rid au n—1)(n—2) 9 
. af =ayrp 1+ (m—1)ayey 22, + ( x agey 8x3 + 
: : see On Las 


or ae = (n—1)(o— 2) A 
ros = 0,07 14+ (n— l)agat a, + us Agty 822 +- >-- + Anty 1. 


We may then define the discriminant of f as the resultant of the two 
binary forms just written. We thus get for the discriminant a 
polynomial in the a’s which differs from the discriminant as above 
defined only by a numerical factor, and for which Theorems 8 and 4 
obviously still hold. If this last detinition be applied to the case of 
a binary quadratic form, it will be seen that it leads us precisely to 
what we called the discriminant of this quadratic form in the earlier 
chapters of this book. 


EXERCISES 


1. Prove that the resultant of two binary forms of degrees n and m respec- 
tively is irreducible. : 

[Sucerstion. When b)=0, RF is equal to a, times the resultant of two binary 
forms of degrees n and m — 1 respectively. Show that if this last resultant is irredu- 
cible, R is also irreducible, and use the method of induction, starting with the case 
(Ne The | 


2. Prove by the methods of this chapter that the bordered determinants of 
Chapter XII are invariants of weight 2. 


_ 8. The following account of Bézout’s method of elimination is sometimes 
given: 
If fand ¢ are polynomials in « which are both of degree n, the expression 


SPY) — P@)SY) 


INTEGRAL RATIONAL INVARIANTS 239 


vanishes, independently of y, for a value of x for which both f and ¢ vanish, and 
is divisible by x — y, since it is zero for z=y. _ Hence 


Fe) LOED— HOLD 





is a polynomial of degree(n—1) in x which vanishes for all values of y when z is 
a common root of f and @. Arranging F according to the powers of y, we have 
the expression 
FH qq +/€y,% + Cogt*+ ++ + Cp, n-1 B} 

+ Yy(Cy9 + Cyt + Cy%? +o + Cy n-12"7}) 

+y * (Cent soe Cop? + vee + oe ue 

+ . . 
ae ge Hee lot Cat, 1% + Cn-1, gt” as Se Cpanel Lae 


If this function is to vanish independently of y, the coefficient of each power 
of y must be zero. This gives n equations between which we can eliminate the n 
quantities, 1 x, z?, .-. z"-1, obtaining the resultant in the form of the determinant, 


Coo =o? C0, m= 1 


Cio Elm 


eee Neches iets =k oe | SSO 


€n-1,9 Cn—1, 1 *** Cn=1, n=1 


With the help of the auxiliary function F we have, in this case, reduced the 
resultant to a determinant of the nth order, while that obtained by the method of 
Sylvester was of order 2 n. 

Criticise this treatment and make it rigorous, applying it, in particular, to the 
case of homogeneous variables. 


4. If fand ¢ are polynomials in (2, y) of degrees n and m respectively and 
are relatively prime, prove that the curves f= 0, @ =0 cannot have more than mn 
points of intersection. 

[Sucexrstion. Show first that the coordinate axes can be turned in such a way that 
no two points of intersection have the same abscissa, and that the equations of the two 
curves are of degrees n and m respectively, after the transformation, in y alone. Then 
eliminate y between the two equations by Sylvester’s dyalitic method. ] 


5. Prove that every integral rational invariant of the binary cubic is a con- 
stant multiple of a power of the discriminant. 

[Sucerstion. Show that if the discriminant is not zero, every binary cubic can 
be reduced by a non-singular linear transformation to the normal form «}? — x3. Then 
as in §48.] 


CHAPTER XVIII 
SYMMETRIC POLYNOMIALS 


83. Fundamental Conceptions. 2% and $ Functions. 

Derrnition 1. A polynomial F(2,, +--+ x,) is said to be symmetric 
of it is unchanged by any interchange of the variables (2, +++ Ln). 

It is not necessary, however, to consider all the possible permuta- 
tions of the variables in order to show that a polynomial is sym- 
metric. If we can show that it is unchanged by the interchange of 
every pair of the variables, this is sufficient, for any arrangement 
(gq Vy +++ Le) may be obtained from (a, x, ---x,) by interchanging 
the z’s in pairs. Thus, if a+ 1, interchange x, with 2,; then in- 
terchange the second letter in the arrangement thus obtained with 
2%; and soon. Hence we have the following theorem: 


THEOREM 1. A necessary and sufficient condition for a poly- 
nomial to be symmetric is that it be unchanged by every interchange of 
two variables. 4 

A special class of symmetric polynomials of much importance are 
the =-functions, defined as follows: 


DEFINITION 2. & before any term means the sum of this term and 
of all the similar ones obtained from it by interchanging the subscripts. 
Thus, for example, 
Dap Sap tage + 24, 
Lajas = via§ + aja. + mi 
+ agah + afek + + abet (a#8) 
+ Gqeh + wey + + + AER4y 
Lajas = vies + mga + +++ + ane 
+ @org + ++: + %5aq 
4 ae 
+ Ln_-4Tn- 
It is clearly immaterial in what order the exponents a, §,--- are 
written. Thus, 2xjafox = Sabaree. 


1%2 
240 


- 
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If we consider any term of a symmetric polynomial, it is evident 
that the polynomial must contain all the terms obtained from this 
one by interchanging the z’s. This aggregate of terms is merely a 
constant multiple of one of the >’s just defined. In the same way it 


| is clear that all the other terms of the symmetric polynomial must 


arrange themselves in groups each of which is a constant multiple 


Sola>d. That is, 


THEOREM 2. very symmetric polynomial is a linear combination 
with constant coefficients of a certain number of X's. 


Among these ’s the simplest are the sums of powers of the 2’s. 
For the sake of brevity the notation is used: 
S,=taraaer+ok+-- +ak (kh =I, 2, ---): 


It is sometimes convenient to write S) =n. 


THEOREM 8. <Any symmetric polynomial in the xs can be ex- 
pressed as a polynomial in a certain number of the S’s. 

Since every symmetric polynomial is a linear combination of a 
certain number of 2’s, in order to prove our theorem we have only 
to show that every = can be expressed as a polynomial in the S’s. 
Now Sa paet +2, 

S,=rh+apt--- +2. 
Hence, if «+ B, 
Sug = xttP + aptB +... + antP + atod + afah + + 
= ay: atB + > ark 

From this we get the formula: 

(1) Lapa, = SiSs— Sore (a+ 8). 

If «= 8, we have 

S? = 7% 4+ 02 4. +02" + Qatas + Qatag+- 
= So, + 2 Dxjix}. 
Hence 
(2) > wine =4(S?2 — S,.). 

Similarly, by multiplying 2a2{2§ by S|, we get the following 
formule where the three integers a, 8, y are supposed to be distinct: 
(3) ZS wtrhay = SS_S, — Sarpy — Sa+ySp — Sery Sa + 2Sarp+y 
(4) Laie = 2 1 (S28, — SyaS,— 2 Sary Sa + 2Soa+y)) 

(5) S wtagas = 4(S3 — 38,8, + 285,). 


R 
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The proof indicated in these two special cases may be extended 
to the general case as follows: 

If we multiply together the two symmetric polynomials 
6) Sateg a 8, = Eat, (k<n) 
we get terms of various sorts which are readily seen to be all con- 
tained in one or the other of the following polynomials, each of these 
polynomials being actually represented : 
(7) Dass we 2, Daye vee GR, sense Se ong, Lan --» Gre. 
Consequently, since the product of the two polynomials (6) is sym- 
metric, it must have the form 

ay a ++ ME Cy Daag --- waE+ Séccaece + ¢, Lxtxh ooo pee™ 
Puls F Cy4 EA2Q + UN 

where ¢,, --- ¢,,; are positive integers. 

Transposing, we may write 


1 
Safes --- ch, =—[Zajeh --- af - Eat—eSapeh ... af 
K+ 3 


= pea rae lay Goh haan Mareen asta eke rB ... gtr 
Cy 2UTS r eC, Darth... aetr], 


Hence, if our theorem is true for =2z{ --- 2, it is also true for 
Lat --- zt,,. But we know it is true for k=1 (by definition of the 
S’s), hence it is true for k= 2, hence for k=3, and so on. Thus 
our theorem is completely proved. 


84. Elementary Symmetric Functions. The notation Latz8 --- 2} 
may be used to represent any = in » variables. If B=qy=-- = 
v = 0, this becomes 22 or S,; if y=---=v=0, it becomes Lartak 5 
and so on. 

Let us now consider =afz§ --- 2% where a, A, --- v, are all 0 or 1. 
The following m cases arise: 


a=1, B=y= + =v =9, 22, 

a= §=1, yo. =p=0, XXL, 

a=B=.-= =1, v=0, 222, 2228 Ln) 

e=B=---=v=l1, UyLq ++ Lye 
The extreme case e=8B=---=v=0 igs of no interest. We will 


represent these n symmetric polynomials by py, py, «++ Pay Tespectively. 
They are called the elementary symmetric functions. 
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THEOREM 1. Any symmetric polynomial in the 2s may be ea- 
pressed as a polynomial in the p’s. 


Since any symmetri¢ polynomial in the 2’s may be expressed as a 
polynomial in the S’s, it is sufficient to show that every S may be 
expressed as a polynomial in the p’s. 

Let us introduce a new variable xz and consider the polynomial 

F (G5 Ly Uqy +++ Ly) = (w— 2% — Ly) ++ (— ap) 
=x" — pv" 14 pe”? — --» +(—1)"Pn 


Using the factored form of f, we may write 


er ee ie os 


Or @—-% CX . w— ay 


Since f vanishes identically when 2 =, we may write 


f(a" —at)— pat — at) + 
Accordingly, . 


= Sat + (Gi — pert (23 — py t+ Pat? 3 + ++, 


Z— 2; 
ie = nx" + (iS; — mp, a"? + (S_ — pS, + mpy)a" 8 + -- 


On the other hand, we have 


as = na" — (n—1)p,a""2 + (n — 2) pa"-8 — 
a 


Hence, equating the coefficients of like powers of a in these two 
expressions, we have 


S,— np,=—(n—1)p, 
S) eee a ie = im = ”\Pa 
8. — PSs ngs Pee 1)1np,-1 =(— 1)" pa_p 


or 


S,—p,=9, 
(1) J Ss Set are 
8. ~ PSr-s + 7iSe4 = -+(—1)"(n—1)pn1 =9 


Now consider the identities 
a? — pat + pare —..+(—1)p,=90 (¢=1, 2,-- 2). 
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Multiplying these identities by v-”, --- ai” respectively and adding 

the results, we have 

(2) Sy — 2S t PoSr-2— aa +(—1)"p,S,1=9 (k=n, n+ iP ve), 
Formule (1) and (2) are known as Newton’s Formule. By 


means of them we can compute in succession the values of Sj, 8, -- 
as polynomials in the p’s: 


S,=Pp 
Sy = p? — 2po, 
Ss = pi — 8p Po + 8p 35 


Thus our theorem is proved. 

It will be noted that Newton’s formule (1) cannot be obtained 
from (2) by giving to & values less than n. The necessity for two 
different sets of formule may, however, be avoided by introducing 
the notation Vi =a = (NE 


Then all of Newton’s formule may be included in the following form: 
(4) S,—PSi-a to +(— DP pS +(—1'kp,=0 (k= 1, 2,--). 


Using this notation, we see that the explicit formule (8) for 
expressing the S’s in terms of the p’s are.wholly independent of the 
number of the 2’s. 

Since the formule referred to in the last section for expressing the 
>’s in terms of the S’s are also independent of », we have established 

THEOREM 2. Jf we introduce the notation py. = Pnig = ++: =9 and 
use Newton's Formule in the form (4), the formula for expressing 
any 2 as a polynomial in the p's is independent of the number n of the x’s. 

When we have & polynomials in 7 variables 


FA eas ind hati, see z). + fi( 21, “ae Dn)y 
we say that there exists a rational relation between them when, and 
only when, a polynomial in & variables 
F(A, + &) 


exists which is not identically zero, but which becomes identically 
zero as a polynomial in the zs when each zg is replaced by the 


corresponding /f, Ff, $=. 
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THEOREM 3. There exists no rational relation between the elemen- 
tary symmetric functions in n variables py, +++ Py: 


For let F(2,, --- z,) be any polynomial in n variables which is not 
identically zero, and let (aj, --- a,) be a point at which this polynomial 
does not vanish. Determine (2, --- z,) as the roots of the equation 


x” — av 14 do == 002 Ge (— })°a; =0. 


For these values of the 2’s, the p’s have the values aj, ---a,, and 
therefore F(p,,---p,) does not vanish for these 2’s, and is con- 
sequently not identically zero as a polynomial in the zs. Thus 
our theorem is proved. 

CoroLLARY. . There is only one way in which a symmetric poly- 
nomial in (24, +++ %) can be expressed as a polynomial in the elementary 
symmetric functions Py, °°: Pn 


For if f is a symmetric polynomial, and if we had two expressions 


for it, F(@y ++ &)= (Pp “Dns 
F(@p mee Zn) = Pol Py» 05); 


then by subtracting these identities from one another we should have 
as an identity in the z’s, 


$1(Py Pn) — Px Par *** Pn) = 9- 
This, however, would give us a rational relation between the p’s, 


unless by(245 °** Sn) = Pol 24) °** Sn)- 


Thus we see that the two expressions for f are really the same. 


EXERCISES 


1. Obtain the expressions for the following symmetric polynomials in terms of 
the elementary symmetric functions: 
S222, Sx 323, Sari rar. 
2. Prove that every symmetric polynomial in (z1,++-%,) can be expressed in 
one, and only one, way as a polynomial in Sj, ..- Sy. 


85. The Weights and Degrees of Symmetric Polynomials. We 
will attach to each of the elementary symmetric functions p; a weight 
as . ; 3 
equal to its subscript, cf. § 79. 


THEOREM 1. A homogeneous symmetric polynomial of degree m in 
the xs, when expressed in terms of the p’s, is isobaric of weight m. 
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Let 
(1) r F(X) Lay *** Tn) =P Py Po» *** Pn) 


be such a polynomial. Since p, is a homogeneous polynomial of the 
first degree in the a’s, p, of the second, etc., any term of ¢, when 
written in the 2's, must be a homogeneous polynomial of degree equal 
to the original weight of the term. Thus, for example, the term 
6 p2p.p3 whose weight is 138, when written in the 2’s will be a homo- 
geneous polynomial of degree 13. Accordingly an isobaric group of 
terms when expressed in terms of the 2’s will, since by Theorem 38, 
§ 84, it cannot reduce identically to zero, be homogeneous of the 
same degree as its original weight. If then @ were not isobaric, 
f would not be homogeneous, and our theorem is proved. 


CorotuARy. If f ts non-homogeneous and of the mth degree, > is 
non-isobaric and of weight m. 


THEOREM 2. A symmetric polynomial in (a, --- £,), when written 
in terms of the elementary symmetric functions p1, +++ Pp, will be of the 
same degree in the p’s as it was at first in any one of the x's. 


Let f be the symmetric polynomial, and write 
I(@y ot a Un) =¢(Pr Pa °~ Pn)» 


and suppose f is of degree m in 2, (and therefore, on account of the 
symmetry, in any one of the 2’s), and that ¢ is of degree pw in the p’s. 
We wish to prove that m=yp. Since the p’s are of the first degree 
in z,, it is clear that mSp. 

If @ is non-homogeneous, we can break it up into the sum of a 
number of homogeneous polynomials by grouping together all the 
terms of like degree. Each of these homogeneous polynomials in 
the p’s can be expressed (by substituting for the p’s their values in 
terms of the z’s) as a symmetric polynomial in the 2’s._ If our theo- 
rem were established in the case in which the polynomial in the p’s 
is homogeneous, its truth in the general case would then follow at 
once. 

Let us then assume that ¢ is a homogeneous polynomial. The 
theorem is obviously true when n= 1, since then p,=—2,. It will 
therefore be completely proved by the method of mathematical in- 
duction if, assuming it to hold when the number of 2’s is 1, 2, --- n—1, 
we can prove that it holds when the number of 2’s is n. 
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For this purpose let us first assume that p, is not a factor of every 
term of d. Then $(p,, -+: p,»-1, 0) is not identically zero but is still 
a homogeneous polynomial of degree mw in (p,, --- Py)» Now let 
x,=9. This makes p, = 0, and gives the identity 


(2) F (By +++ Gy = HC pi +> Pav 9); 


where pj,--- pl, are the elementary symmetric functions of 
(X4, +++ Z_y), and f(a, --+ %_;,9) is a symmetric polynomial of 
degree m, in 2,, where m;Sm. From the assumption that our 
theorem holds when the number of z’s is n—1, we infer from (2) 
that #=m,Sm; and since we saw above that mu cannot be less than 
m, we infer that “=m, as was to be proved. 

There remains merely the case to be considered in which p, is a 
factor of every term of ¢. Let p* be the highest power of p, which 
occurs as a factor ing. Then 


(Py -* Pn) =PrPi(Pp *** Pn) 


where ¢, is a polynomial of degree ~—f. Putting in for the p’s 
their values in terms of the z’s, we get 


(3) Is see Ln) = via eee rea ACE eee @e)s 
where 
(4) Fito con Ln) = (Pp “++ Dy) 


From (3) we see that 7, is of degree m —k& in z,, and from (4), since 
¢, does not contain p, as a factor, that the degrees of f, in 2, and 
of ¢, in the p’s are equal 
1 P ae m—-k=p—k. 

From which we see that m= p, as was to be proved. 

The two theorems of this section are not only of theoretical 
importance, they may also be put to the direct practical use of 
facilitating the computation of the values of symmetric polynomials 
in terms of the p’s. 

In order to illustrate this, let us consider the symmetric function 


SF (By +++ Bq) = Taha, r,. 


Since f is homogeneous of the fourth degree in the 2’s, it will, by 
Theorem 1, be isobaric of weight 4 in the p’s. Since it is of the 
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second degree in 2, it will, by Theorem 2, be of the second degree in 
the p’s. Hence 
(5) Lajayv, = Ap, p; + Bp} + Cry 
where A, B, and C are independent of the number n (Theorem 2, 
§ 84), and may be determined by the ordinary method of unde- 
termined coefficients. 
Take n =38,so that p,=0. Letting 2,=0, 2,=2,=1, we have 
D2. ps1, po 0. Substituting these values in (5), we find B=0. 
Letting #,.= —1, 7,=2,=1, we have p, =1, p, = —1, pg = —1, 
which gives A= 1. 
Now let ~=4, 2,0, =2,=2,21, 
From this we find p,=4, 7p,=6, p,=4, p,=—1. 
Substituting this in (5) gives C= —4. Hence 


VT}, = P1 Pz — 4P4- 


EXERCISES 
1. The symmetric function 


S (41 ++ Gn) = Sxirex3 + Sarjxz + Sayroxsr4 


is homogeneous of the fourth degree in the 2’s, and is of the second degree in 21; 
hence, when written in terms of the p’s, it will have the same form, Apips + Bp3 
+ Cps, as the above example. Compute the values of A, B, and C. 


2. If f (x1, v2, 43) = (x1 —X2)?(x1 — %3)*(x2 — 23)”, show that 
S (#1, t2, 23) = — 27 p} — 4p} + 18 pipops — 4 pips + pip. 


86. The Resultant and the Discriminant of Two Polynomials in 
One Variable. Let 
SF (a) =a" + aya") + aga” + + fay 


= (t — ay (x — at) «++ (@— a), 
p(x) =a" + bya” + boa”? + + +8, 
= (e — B(x — By) +» (@— Bn) 
be two polynomials in 2, and consider the product of the mn factors 
(a — Bye — Bg) +++ (& —Byn) 
(1) (4, — By) — Bg) +++ (0 — Bin) 
(a, *~ By en =a By) os (an—Bm) 
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This product vanishes when, and only when, at least one of the a’s 
is equal to one of the #’s. Its vanishing therefore gives a necessary 
and sufficient condition that f and @ have a common factor. More- 
over, the product (1), being a symmetrical polynomial in the «’s and 
also in the §’s, can be expressed as a polynomial in the elementary 
symmetric functions of the «’s and 4’s, and therefore as a polynomial 
in the a’s and 6’s._ ‘This will be still more evident if we notice that 
the product (1) may be written 


$(%4) 6(%y) ++ Pon). 


In this form it is a symmetric polynomial in the «’s whose co- 
efficients are polynomials in the 6’s, and it remains merely to bring 
in the a’s in place of the a’s. 

We thus see that the product (1) may be expressed as a poly- 
nomial F'(a,, --- @,; 64, --+ 6,,) in the a’s and 6’s whose vanishing gives 
a necessary and sufficient condition that f and ¢ have a common 
factor. In §68 we also found a polynomial in the a’s and 6’s whose 
vanishing gives a necessary and sufficient condition that f and ¢ 
have a common factor, namely the resultant R of f and @. 

We will now identify these two polynomials by means of the 
following theorem : 


‘THEOREM 1. The product (1) differs from the resultant R of f 
and only by a constant factor, and the resultant is an irreducible 
polynomial in the a’s and 6's. 

In order to prove this theorem we will first show that this prod- 
uct (1), which we will call F(a, ---a,3; 64, -+-5,), is irreducible. 
This may be done as follows: Suppose F is reducible, and let 
F (a, 8 An 3 b,, SOF bn) = LP (a4, ne yf b,, Rate bm) Pay, + Ay by, aitis Dn) 
where fF, and F, are polynomials neither of which is a constant. 
Then, since the a’s and 0’s are symmetric polynomials in the «’s and 
B’s, F, and F,, may be expressed as symmetric polynomials ¢, and ¢, 
in the @’s and #’s, and we may write 


(4; Fae) (aap By nite Bm) fol 045 8+ On 3 By» PEC Ba) 
(%, — By) (% — By) +++ (a — Bm) 
(@ — By) (% — Bo) ++: (4 — Bm) 


ee ea ee, — 8. 
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The factors on the right-hand side of this identity being irreducible, 
we see that ¢, must be composed of some of these binomial factors 
and ¢, of the others. This, however, is impossible, since neither ¢, 
nor ¢, would be symmetric. Hence /’ is irreducible. 

Now, since /’= 0 is a necessary and sufficient condition for f(z) 
and ¢(7) to have a common factor, and R=0 is the same, any set of 
values of the a’s and 6’s which make #=0 willalsomake R=0. Hence 
by the theorem for n + m variables analogous to Theorem T, § 76, / is 
a factor of R. Also, since F is a symmetric polynomial in the @’s 
and §’s of degree m in each of the @’s and m in each of the f’s, by 
Theorem 2, § 85, it must be of degree m in the a’s and n in the 0’s. 
But # is of degree not greater than m in the a’s and n in the 0’s, as-is 
at once obvious from a glance at the determinant in § 68. Hence F, 
being a factor of A, and of degree not lower than #, can differ from 
it only by a constant factor. Thus our theorem is proved. 

Let us turn now to the question: Under what conditions does 
the polynomial f(x) have a multiple linear factor? Using the same 
notation as above, we see that the vanishing of the product 


{oy — Gy) (cy — Gg) +++ (4 — Oy) 


(@,-4 pa Oy 


is a necessary and sufficient condition for this. P is not symmetric 
in the @’s, since an interchange of two subscripts changes P into — P. 
If, however, we consider P? in place of P, we have a symmetric 
polynominal, which can therefore be expressed as a polynomial in 
the a’s, 


[P (a4, +++ &,) ]? = F(a, +++ ay) 


Moreover, = 0 is also a necessary and sufficient condition that f(x) 
have a multiple linear factor. 

On the other hand, it is easily seen that f(z) has a multiple linear 
factor when and only when f(x) and f!(2) have a common linear 
factor. A necessary and sufficient condition for f(x) to have a mul- 
tiple linear factor is therefore the vanishing of the resultant of f(«) 
and f'(v). This resultant we will call the discriminant A of f(z). 
It is obviously a polynomial in the coefficients of f. 
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THEOREM 2. The polynomials F and A differ only by a constant 
factor, and are irreducible. 


The proof of this theorem is similar to the proof of Theorem 1, 
and is left to the reader. 


EXERCISES 
1. Compute by the use of symmetric functions the product (1) for the two 
polynomials ab axe Gs, 
a? + bye Ht ba, 
and compare the result with the resultant obtained in determinant form. 


2. Verify Theorem 2-by comparing the result of Exercise 2, § 85, with the 
discriminant in determinant form of the polynomial 


x? + a,x? + Agu + az. 


CHAPTER XIX 
POLYNOMIALS SYMMETRIC IN PAIRS OF VARIABLES 


87. Fundamental Conceptions. % andS$Functions. The variables 
(21, +++ £,) which we used in the last chapter may be regarded, if we 
wish, not as the codrdinates of a point in space of n dimensions, but 
rather as the codrdinates of m points on a line. In fact this is the 
interpretation which is naturally suggested to us by the ordinary 
applications of the theory of symmetric functions (cf. §86). Looked 
at from this point of view, it is natural to generalize the conception 
of symmetric functions by considering m points in a plane, 


(1) (1) Y1)> ®a» Yas“ Env Yn)» 
DEFINITION. <A polynomial, 


Flay) Yy 3 Vay Yas °° Lar Yn) 


in the codrdinates of the points (1) is said to be a symmetric polynomial 
in these pairs of variables if it is unchanged by every interchange of 
these pairs of variables. 


As in the case of points on a line, we see that it is not necessary 
to consider all the possible permutations of the subscripts in order to 
show that a polynomial #’ is symmetric. It is sufficient to show that 
F is unchanged by the interchange of every pair of the points (1). 

We will introduce the = notation here precisely as in the case of 
single variables. Thus, for example, 


Lapyy Sepyy bagys + + + aeyh 


Can Diaite nr e etna Canie 24:9 ;B 1 Fen rE 
VOPY LLY y= py eyYy + epyprpyy? + os 


and so on. 
As in the case of single variables, it is clear that the order in 
which the pairs of exponents «,, 8;; @, 8,3; ... are written is imma- 


terial; aud also that every symmetric polynomial in the pairs of variables 
(1) ts a linear combination of a certain number of X's. 
252 
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We introduce the notation 
Fees al oe 
SusEaiy, sei teint tek (Gog yf 
THEOREM. Any symmetric polynomial F(a, Y1 3 +** Zn Yn) may 
_ be expressed as a polynomial in these S’s. 


The proof of this theorem is exactly like that of Theorem 3, § 83, 
and is left to the reader. 


88. Elementary Symmetric Functions of Pairs of Variables. 
Every = function of » pairs of variables may, by giving to the a’s 
and #’s suitable values, be written in the form 


(1) Sarybagyh anys 


DEFINITION. The function (1) ts said to be an elementary sym- 
metric function of the pairs of variables (a, Y1)) -** (@ns Yn) when, and 


only when, oe ara Cae aye 
but not all the «s and fs are zero. 


We shall adopt the following notation for these elementary sym- 
metric functions: Py ==tp P= ZI p 


Po = = UXyp Py == XY Po. = = WY 
Dg =U 4_ °° Bm °° * Di, ni = ZL VY Yn 8 = Don = Yio 7 Yas 


It is clear that there are a finite number, 3 n (n+ 3), of p,’s, but 
an infinite number of S’,’s. 

We will attach to each pa weight with regard to the x’s equal to its 
first subscript and a weight with regard to the y’s equal to its second 
subscript. When we speak simply of the weight of p,, we will mean 
its total weight, that is, the sum of its subscripts. 


THEOREM. Any symmetric polynomial F(x, Y13 ++ ny Yn) may be 
expressed as a polynomial in the pj's. 


Since, by the theorem in § 87, any such polynomial may be 
expressed as a polynomial in the S,,’s, it is sufficient to show that 
the S’,’s may be expressed as polynomials in the p,’s. 


Let 2) =AX,+ My» E,=Ax, + MY °° E = At, + HY nr 
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and form the elementary symmetric functions of these &’s : 
1 SDE SADA, + MDY, = AP + HPow 
Ty = DEE, = U(Aa, + MYy) (AX + HY) 
SND a ty + APTLY, + MZ Ye 
= Poy + AMP + FH? Pow» 
Ts = EEE, bs SM pgy + MPa + AM Pg + HPops 


Ty = Ey Eyer E, = Dng + ATM Png, + Pog + 0 + "Pon. 
Also let o,= > EF (4 =1, 2,%c0) 
Let « and £8 be positive integers, or zero, but not both zero. 

Then Cup = > eas = Ae Sa zs AHH Ty St aet Bly, HE boc 

=r? 8.8.9 ae EEE BS eee SB ocd 
But by Theorem 1, § 84, we may write 
Cap = E(t Toy ++ Tn)s 
where # is a polynomial. Hence 
AES c4p,0 ATP Soe 4,1 + =U (Pros ++ Pon Av H)s 


where VY is a polynomial. Regarding this.as an identity in (A, ») 
and equating the coefficients of the terms containing A*4#*, we get an 
identity in the 2’s and y’s, 


Sap = B (Pio *** Pon)» 


where ® is a polynomial in the p’s. Thus our theorem is proved. 

Theorem 8, § 84, does not hold in the case of pairs of variables, 
as relations between the 4n(n+8) p,’s do exist; for example, if 
n= 2, the polynomial 


4 Doo Poo — Pao Pt — P2o Poa + PiPuPo — Pia 


vanishes identically when the p’s are replaced by their values in 
terms of the a’s. It does not vanish identically when n= 3. 

In view of the remark just made, it is clear that the represen- 
tations of polynomials in pairs of variables in terms of the p,,’s will 
not be unique. 

For further information concerning the subjects treated in this 
section, the reader may consult Netto’s Algebra, Vol. 2, p. 63. 


POLYNOMIALS SYMMETRIC IN PAIRS OF VARIABLES 255 


EXERCISES 


1. Prove that a polynomial symmetric in the pairs of variables (x, y;) and 
which is homogeneous in thé 2’s alone of degree n and in the y’s alone of degree m 
can be expressed as a polynomial in the p,’s isobaric of weight n with regard to 
the z’s, and m with regard to the y’s. 

2. Express the symmetric polynomial 

= xiyoys 
in terms of the p;’s by the method of undetermined coefficients, making use of 
Exercise 1. 

3. A polynomial in (%1, 91, 213 2, Y2; 225 +++ Lay Yn» Zn) Which is unchanged by 
every interchange of the subscripts is called a symmetric polynomial in the n 
points (xj, Yj, 2). 

Extend the results of this section and the last to polynomials of this sort. 


89. Binary Symmetric Functions. The pairs of variables (2,, y,), 
*- (px Yn) may be regarded as the homogeneous codrdinates of n 
points on a line as well as the non-homogeneous codrdinates of n 
pointsinaplane. It will then be natural to consider only sym- 
metric polynomials which are homogeneous in each pair of variables 
alone. Such polynomials we will call binary symmetric functions. 
Most of the p,’s of the last section are thus excluded. The last 
m+1 of them (Ppyos Pn-1,°** Pon)» however, are homogeneous of 
the first degree in each pair of variables alone. We will call them 
the elementary binary symmetric Functions. 

THEOREM 1. Any binary symmetric function in (a4, Y13 -** Lny Yn) 
can be expressed as a polynomial in (Pros Pn—1,19 *** Pon): 

If we break up our binary symmetric function into 2s, it is clear 
that each of these >’s will itself be a binary symmetric function, or, 
as we will say for brevity, a binary 2. It is therefore sufficient to 
prove that our theorem is true for every binary 2. The general 
binary = may be written 


Zapypiag yy? Taryn” — (ty Sg + SAy), 
where, if we denote by m the degree of this = in any one of the pairs 
of variables, en eee = 0s 4-By 

Let us assume for the moment that none of the y’s are zero, and let 


ae ek tenet, Xe 
t yy fem ne an 
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Now consider the elementary symmetric functions of these X’s: 


P,= =X, = Pim, 
Pon 

Pi => X,X, = Po, n-2, 
Pon 


P, = XX,» X, =P. 
We may write Pon 





>a figs 2 2. Gn g7Bn ss 
(1) 141 a wn = XaXn ... X= O(P,, --- P,), 

on 
where, since we have assumed «, 20,2 --- Ze, ® is a polynomial 


of degree a, in the P’s (Theorem 2, § 85). Hence we may write 





(2) OCP ae y= P( Pon Pr, seit a Pro)», 
on 
where ¢ is a homogeneous polynomial of degree «. 


We thus get from (1) and (2) 


(3) ZS apyyt + LYn” = PorP( Pons P1sn—a» *** Pro)» 

an equation which holds except when one of the y’s is zero. Since 
each side of (3) can be regarded as a polynomial in the 2’s and y’s, 
we infer, by Theorem 5, § 2, that this is an identity, and our theorem 
is proved. 

By Theorem 1, § 85, ® is isobaric of pete Oy + Gy + «+ +O, in 
the P’s. Hence = apy? --- xi"yk", when expressed in terms of these 
(n+1) p,,/’s, is isobaric of weight «, + a,+ --- +@,, provided we count 
the weight of the p,’s with regard to the 2’s. Passing back now to 
an aggregate of a number of such >’s, we get 


THEOREM 2. Jf a binary symmetric function is homogeneous in 
thenw’s (or y’s) of degree k, it will, when expressed in terms of 
Pro» Pn-1,19 *** Pon be isobaric of weight k with regard to the x’s (or y’s). 


We have seen in the proof of Theorem 1 that the polynomial ¢ in 
(3) is a homogeneous polynomial of degree @, in the p’s; so that 
day," --- vm yf" is a homogeneous polynomial of degree a,+ 
=m in the p’s. Hence 


THEOREM 8. Any binary symmetric function of degree m in each 
pair of variables will, when written in terms of Pros Pn—1,1. *** Pon 0¢ & 
homogeneous polynomial of degree m in these p’s. 


POLYNOMIALS SYMMETRIC IN PAIRS OF VARIABLES 257 


EXERCISES 


1. Prove that no rational relation exists between pno, -++ Pon, and hence that a 


binary symmetric function can be expressed as a polynomial in them in only one 
way. 


2. By a ternary symmetric function is meant a symmetric polynomial in n 
points (x; ¥;, 2:) which is homogeneous in the coérdinates of each point. 

Extend the results of this osu oe to ternary symmetric functions. Cf. Exer- 
cise 3, § 88. 


90. Resultants and Discriminants of Binary Forms. It is the 
object of the present section to show how the subject of the re- 
sultants and discriminants of binary forms may be approached from 
the point of view of symmetric functions. 


Let Hey m) = aye + afl + --- + a0? 
= (Gia, — Oty (HZ Ty — OLy) ++ (Ont — La), 
Ply %q) = bay + bya My ov byw! 
= (Bix, in Bix, X Box, — B5%q) * (Bre — Aiwa 
be two binary forms. Each of these polynomials has here been 
written first in the unfactored and secondly in the factored form. 


By a comparison of these two forms we see at once that the elemen- 
tary binary symmetric fractions of the n points 


(ah, 1), (eta, Of) +> (hy On) 
are Ags — My; Pe G24 
and of the m points ({}, 6{').(8% By), --- Bn» Bin 
are Bos — B45 By, «++ (— 1)™bne 


Let us now consider the two linear factors 
CL, — Wy, Bila, — Bite. 
A necessary and sufficient condition for these factors to be propor- 
tional is that the determinant 


a8; — GBF 
vanish. Let us form the eae a all such determinants : 
(«lft — a) (WB — 81!) --- (allBt, — oh8f) 
pe | (B= BL) ($B — eB) (A, — 0,84 


Cis 7" Sle 8") (ell ae ad th mee 1 (atl i = ay ") 
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The vanishing of this product is a necessary and sufficient con- 
dition that at least one of the linear factors of f be proportional to 
one of the linear factors of ¢, that is, that fand ¢ have a common 
factor which is not a constant. 

We may obviously reduce P to the simple form 

P =f (By BY) S( Bx» Bz) + FB Bin): 

In this form it appears as a homogeneous polynomial of the mth 
degree in the a’s, and as a symmetric polynomial in the m points 
(8!, Bi’). Moreover, it is obviously a binary symmetric function which 
is of the nth degree in the codrdinates of each point. Consequently, 
by Theorem 3, $89, it can be expressed as a homogeneous polynomial 
of the nth degree in the elementary binary symmetric functions of 
the points (8, 8’), that is, in the 0’s. Thus we have shown that the 
product P can be expressed as a polynomial in the a’s and b’s which is 
homogeneous in the a’s of degree m and in the b’s of degree n. 

In § 72 we found another polynomial in the a’s and 08’s, whose 
vanishing also gives a necessary and sufficient condition for fand @ 
to have a common factor, namely, the resultant R. We will now 
identify these polynomials by means of the following theorem : 


THEOREM 1. The product P differs from the resultant R of f and 
gd only by a constant factor, and the resultant is an irreducible poly- 
nomial in the a’s and b's. 

We may show, in exactly the same way as in the proof of Theo- 
rem 1, § 86, that P, when expressed as a polynomial in the a’s and 
b’s, is irreducible. Since P=0 and K=0 each give a necessary and 
sufficient condition for f and ¢ to have a common factor, any set of 
values of the a’s and 6’s which make P=0 will also make R=0. 
Thus by Theorem 7, § 76, P is a factor of R. We have seen that P 
is of degree m in the a’s and n in the 6’s. The same is also true of 
R, as may easily be seen by inspection of the determinant of § 68. 
Hence, P being a factor of R, and of the same degree, can differ 
from it only by a constant factor. Thus our theorem is proved. 

Let us now inquire under what conditions the binary form f(2,, 24) 
has a multiple linear factor. Using the same notation as above, we 
see that the vanishing of the Bass 

(ek — ee ee — el) «Col ~ el 
(aif — ae) (ale — eet) = Cal, as» yal) 


(ai! abe eae Ge se 
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is a necessary and sufficient condition for this. P, is not symmetric 
in the pairs of a’s, since an interchange of two subscripts changes P, 
into —P,. If, however, we consider P? instead of P,, we have a 
binary symmetric function which can be expressed as a polynomial 
in the a’s 
[Pla als --- aly ally P= Fay + ,) 

Moreover, F vanishes when, and only when, P, does. Accordingly 
F=0 is a necessary and sufficient condition for f(x, x,) to have a 
multiple linear factor. 

But the vanishing of the discriminant A (cf. § 82) of f(2,, xq) 
is also a necessary and sufficient condition for this. 

THEOREM 2. F and A differ only by a constant factor, and are 
irreducible. 

The proof of this theorem, which is practically the same as that 
of Theorem 1, is left to the reader. 

If we subject the two binary forms f and ¢, which we may sup- 
pose written in the factored form, to the linear transformation 
(1) Ly = C42 + Cyy%y 

Ly = Cy4X} + Cop, 
we get two new binary forms 
(Al'e, — Alal)( Af — All) (Alla! — Ala), 
(Bix, — Bye) Bya — Byry) (Bua, — Bre), 


" I! / ps I J 
where A=: oft, — a1 Bi = 5 C41 — Blogs 
/ A hes I ie eS // / 
Al, = — A Cig + Hilg95 B} =— B; Cio + Rice 
so that Aj B; — A,B] = e( af! 8; — “i87), 


where ¢ is the determinant of the transformation (1). 

Since the linear transformation (1) may be regarded as carrying 
over thea’s and §’s into the A’s and B's, the last written identity 
shows us that «/8}—«/@)' is, in a certain sense, an invariant of 
weight 1. It can, however, not be expressed rationally in terms of 
the as and 6’s._ Such an expression is called an irrational invariant. 

Since the resultant of f and ¢ is the product of mn such irra- 
tional invariants of weight 1, it is evident that the resultant itself is 
an invariant of weight mn. Thus we get a new proof of this fact, 
independent of the proof given in § 82. 

A similar proof can be used in the case of the discriminant of a 


binary form. 
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EXERCISES 
Develop the theory of the invariants of the binary biquadratic 
SF (ty %q) =aget + 4 a, rx, + 6 avin + 4 2,08 + a,x$ 
= (oul!x, — Otfary) (44/2, — Myary) (Ua, — Oehrg) (lx, — RZ) 
along the following lines: 


1. Start from the irrational invariants of weight 2, 
A = (alee, — yee) (alo, — eee), 
B= (ct! 045 — oct) ("404 — lly!) , 
C= (ory! oe — cehory') (Oey ces — e230), 
whose sum is zero, and the negatives of whose ratios are the cross-ratios ofthe 
four points (04, 0'), (l, as’), (05, oes! ), (04, 04!) 
2. Form the further irrational invariants of weight 2 
E,x=B—C, E,=C—A, E3;=A—B; 
and prove that every homogeneous symmetric polynomial in E1, E2, E3 is a 
binary symmetric function of the four points (qj, «;!), and therefore an integral 
rational invariant of f- 
3. In particular 
Go= EE, + E2E3 + Esky, G3= E,E2E3 
are homogeneous integral rational invariants of weights 4 and 6, and of degrees 2 
and 8 respectively. Prove that 
Go= — 3692, G'3=482 9s, 
where go=aoas —4 a1a3+4 8 a, 
9340204 + 2 a1a2a3 — a9a3 — a}aa — a3. 


These expressions gz and gs3 are the simplest invariants of f.* 


4. Prove that the discriminant A of fis given by the formula 
‘ A=q3 — 2793. 


5. If A=0, prove that g3 = 0 is a necessary and sufficient condition that the 
four points f=0 form a harmonic range; and that g,.=0 is a necessary and 
sufficient condition that they form an equianharmonic range. (Cf. Exercise 3, 


§ 33.) 


6. Prove that g2=93=0 is a necessary and sufficient condition that f have 
at least a threefold linear factor. 


* They are among the oldest examples of invariants, having been found by Cayley 
and Boole in 1845. 

t Notice that we here have a projective property of the locus f=0 expressed by 
the vanishing of two integral rational invariants; cf. the closing paragraph of § 81. 
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7. If X is the absolute irrational invariant 


A 
h=-5 


i.e. one of the cross-ratios of the points f=0, prove that the absolute rational 
invariant 8 
[=2 
A 
oie SOS sel 
S27 (N= 1)PAS 
8. Prove that a necessary and sufficient condition for the equivalence of two 
biquadratic binary forms neither of whose discriminants is zero is that the inva- 
riant J have the same value for the two forms. 


can be expressed in the form 


9. Prove that a necessary and sufficient condition for the equivalence with 
regard to linear transformations with determinant + 1 of two biquadratic binary 
forms for which gz and gs are both different from zero is that the values of gp 
and gs be the same for one form as for the other. 


10. Prove that if the discriminant of a biquadratic binary form is not zero, the 
form can be reduced by means of a linear transformation of determinant + 1 to the 
normal form 4 abe, — gorird — gah 

11. Prove that every integral rational invariant of a biquadratic binary form 
is a polynomial in g2 and gs. 





12. Develop the theory of the invariants of a pair of binary quadratic forms 
along the same lines as those just sketched for a single biquadratic form. 


13. Prove that every integral rational invariant of a pair of quadratic forms 
in n variables is an integral rational function of the invariants @p, --. @, of § 57. 


[Suecxrstion. Show first that, provided a certain integral rational function of 
the coefficients of the quadratic form does not vanish, there exists a linear transfor- 
mation of determinant + 1 which reduces the pair of forms to 


O02 + Oo? foivee Oy 
Bix; + Borg ++ + B,x?. 


Then show that every integral rational invariant of the pair of quadratic forms can be 
expressed as a binary symmetric function of (01, 81), (2, Bz), +++ (On, Bn), and that 
the @’s are precisely the elementary binary symmetric functions. ] 


CHAPTER XX 


ELEMENTARY DIVISORS AND THE EQUIVALENCE OF 
\-MATRICES 


91. \-Matrices and their Elementary Transformations. The theory 
of elementary divisors, invented by Sylvester, H. J. S. Smith, and, 
more particularly, Weierstrass, and perfected in important respects 
by Kronecker, Frobenius, and others, has, in the form in which we 
will present it,* for its immediate purpose the study of matrices 
(which without loss of generality we assume to be square) whose 
elements are polynomials in a single variable >». Such matrices we 
will callA-matrices.| The determinant of a A-matrix is a polynomial 
in X, and if this determinant vanishes identically, we will call the 
matrix a singular A-matrix. By the rank of a A-matrix we under- 
stand the order of the largest determinant of the matrix which is 
not identically zero. 

We have occasion here, as in § 19, to consider certain elementary 
transformations which we define as follows:. 


DEFINITION 1. By an elementary transformation of a -matrix 
we understand a transformation of any one of the following forms : 
(a) The interchange of two rows or of two columns. 


(6) The multiplication of each element of a row (or of a column) by 
the same constant not zero. 


(c) The addition to the elements of a row (or column) of the products 


of the corresponding elements of another row (or column) by one and 
the same polynomial in 2X. 


* Various modifications of the point of view here adopted are possible and im- 
portant. First, we may consider matrices whose elements are polynomials in any num- 
ber of variables. Secondly, we may confine ourselves to polynomials whose coefficients 
lie in a certain domain of rationality. Thirdly, we may approach the subject from the 
side of the theory of numbers, assuming that the coefficients of the polynomials are 
integers. The simplest case here would be that in which the elements of the matrix 
are themselves integers ; see Exercise 2, §91, Exercise 3, §92, and Exercise 2, § 94. 

{ The matrix of a pencil of quadratic formsis an important example of a \-matrix 
to which the general theory will be applied in Chapter XXII. 
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If we pass from a first matrix to a second by an elementary trans- 
formation, it is clear that we can pass back from the second to the 
first by an elementary transformation. Thus the following defini- 
tion is justified: 


DEFINITION 2. Two d-matrices are said to be equivalent if it is 
possible to pass from one to the other by means of a finite number of 
elementary transformations. 


We see here that all A-matrices equivalent to a given matrix are 
equivalent to each other; and, as in §19, that two equivalent A-ma- 
trices always have the same rank. 

The rank of a \-matrix is not, however, the only thing which is 

‘left unchanged by every elementary transformation. In order to 
show this we begin with 


Lemma 1. Jf the polynomial $(r) is a factor of all the t-rowed 
determinants of a X-matrix a, it will be a factor of all the t-rowed 
determinants of every X-matriz obtained from a by means of an elemen- 
tary transformation. 


If the transformation. is of the type (a) or (6) of Definition 1, this 
lemma is obviously true, since these transformations have no effect 
on the i-rowed determinants of a except to multiply them by con- 
stants which are not zero. If it is of the type (c), let us suppose it 
consists in adding to the elements of the pth column of a the corre- 
sponding elements of the gth column, each multiplied by the poly- 
nomial (A). Any ¢-rowed determinant of a which either does not 
involve the pth column, or involves both the pth and the qth, will 
be unaffected by this transformation. An «rowed determinant 
which involves the pth column but not the gth may be written after 
the transformation in the form A + W()B, where A and B are t-rowed 
determinants of a; so that here also our lemma is true. 


THEOREM 1. Jf a and b are equivalent d-matrices of rank r, and 
DX) is the greatest common divisor of the t-rowed determinants (i <r) 
of a, then it is also the greatest common divisor of the i-rowed determt- 
nants of b. 


For by our lemma, DX) is a factor of all the z-rowed determi- 
nants of b; and if these determinants had a common factor of higher 
degree, this factor would, by our lemma, be a factor of all the 
-rowed determinants of a; which is contrary to hypothesis. 
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The theorem just proved shows that the greatest common divisors 
D,(X), «-- D,(A) are invariants with regard to elementary transfor- 
mations, or, more generally, that they are invariants with regard to 
all transformations which can be built up from a finite number of 
elementary transformations. In point of fact they form, along with 
the rank 7, a complete system of invariants. To prove this we now 
proceed to show how, by means of elementary transformations, a 
d-matrix may be reduced to a very simple normal form. 


Lemma 2. Jf the first element * f(r) of ad-matrix is not identically 
zero and is not a factor of all the other elements, then an equivalent 
matrix can be formed whose first element is not identically zero and ts of 
lower degree than f. 

Suppose first there is an element f,(X) in the first row which is 
not divisible by f(A) and let 7 denote the number of the column in 
which it lies. Dividing f, by f and calling the quotient g and the 


remainder 7, we have Fi) = da)FO) + 7A). 


Accordingly, if to the elements of the jth column we add those of the 
first, each multiplied by —q(\), we get an equivalent matrix in which the 
first element of the jth column is r(X), which is a polynomial of degree 
lower than f(A). If now we interchange the first and yth columns, 
the truth of our lemma is established in the case we are considering. 

A similar proof obviously applies if there is an element in the 
first column which is not divisible by f(\). 

Finally, suppose every element of the first row and column is 
divisible by f(A), but that there is an element, say in the 7th row and 
jth column, which is not divisible by f(A). Let us suppose the ele- 
ment in the first row and jth column is ¥(A)f(\), and form an 
equivalent matrix by adding to the elements of the jth column 
—() times the corresponding elements of the first column. In 
this matrix, f(A) still stands in the upper left-hand corner, the first 
element of the jth column is zero; the first element of the 7th row 
has not been changed and is therefore divisible by f(A); and the 
element in the zth row and jth column is still not divisible by f. 
Now form another equivalent matrix by adding to the elements of 
the first column the corresponding elements of thejyth column. The 
upper left-hand element is still f(A), while the first element of the 


* By the first element of a matrix we will understand the element in the upper left- 
hand corner. 
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ith row is not divisible by f(\). This matrix, therefore, comes under 
the case already treated in which there is an element in the first 
column which is not divisible by f(A), and our lemma is established. 


Lemma 3. Jf we have a r-matrix whose elements are not all iden- 
tically zero, an equivalent matrix can be formed which has the following 
three properties : 

(a) The first element f(A) is not identically zero. 

(6) All the other elements of the first row and of the first column are 
identically zero. 

(c) Hvery element neither in the first row nor in the first column is 

divisible by f(r). 
' For we may first, by an interchange of rows and of columns, 
bring into the first place an element which is not identically zero. 
If this is not a factor of all the other elements, we can, by Lemma 2, 
find an equivalent matrix whose first element is of lower degree and 
is not identically zero. If this element is not a factor of all the 
others, we may repeat the process. Since at each step we lower the 
degree of the first element, there must, after a finite number of steps, 
come a point where the process stops, that is, where the first element 
is a factor of all the others. We can then, by using transformations of 
type (c¢) (Definition 1), reduce all the elements in the first row and in 
the first column except this first one to zero, while the other elements 
remain divisible by the first one. Thus our lemma is established. 

Finally, we note that since f(X) in the lemma just proved is a 
factor of all the other elements of the simplified matrix, it must, by 
Theorem 1, be the greatest common divisor of all the elements of 
the original matrix. 

The lemma just proved tells us that the \-matrix of the nth order 
of rank r>0 


(1) 


a4 ae An, 


Any *** Un 


can be reduced by means of elementary transformations to the form 


fx) 0 aes 0 
Or ind 


(2) ete | 


0 Dnt, 1 Sipe Ot aet 
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where f,(A) #9 and where f,(X) is a factor of all the 6’s. The last 
written matrix being necessarily of rank 7, the matrix of the (n—1)th 
order bi 


(3) 


pes by, n-1 


bn -1, ak ees b,-1, n— 


is of rank r—1. Consequently, if r>1, (8) may be reduced by 
means of elementary transformations to the form 


FAN) Oe oom Os 
0 


Cy, ft CL nag 


(4) Sk Ne ears a oa 


Rae cca erie a 


where f,(X) #0 and where f,(X) is a factor of allthe e’s. By Theorem 1, 
F(A), being the greatest common divisor of all the elements of (4), is also the 
greatest common divisor of all the 6’s, and is therefore divisible by f,(\). 

Now it is important to notice that the elementary transformations 
which carry over (8) into (4) may be regarded as elementary transforma- 
tions of (2) which leave the first row and column of this matrix unchanged. 
Thus by a succession of elementary transformations, we have reduced 


(1) to the form Fr) 0 (enero 
Offa) 0 he 9 
Omg 0 


Oy OS emg 


(5) 





0 OD C294 lang, ng 


where neither f, nor f, vanishes identically, f, is a factor of f,, and 
J, is a factor of all the e’s. 

If r > 2, we may treat the (m — 2)-rowed matrix of the e’s, which 
is clearly of rank rv — 2, in a similar manner. Proceeding in this 
way, we finally reduce our matrix (1) to the form 
SF (cpu ieee See Oar 

0 BON) os) 20m 0e eed 

Gita) Gece t 0 arte 
(6) One a noe lee UeeenO 
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where none of the f’s is identically zero, and each is a factor of the 
next following one. 

So far we have used merely elementary transformations of the 
forms (a) and (ce), Definition 1. By means of transformations of the 
form (0) we can simplify (6) still further by reducing the coefficient 
of the highest power of \ in each of the polynomials f(A) to unity. 
We have thus proved the theorem: 


THEOREM 2. very \-matrix of the nth order and of rank r can 
be reduced by elementary transformations to the normal form 


wipe Oe Or Ot 
QO FA) - Oe Ota a0) 

(7) 0 O + BA) 0 + 0 |p 
0 Oe eee, 200, WOke.o 0) 








where the coefficient of the highest power of X in each of the polynomials 
E(X) ts unity, and EX) is a factor of E;,,(r) for t= 1, 2, --- r—1. 
By Theorem 1, the greatest common divisor of the t-rowed determi- 
nants (<r) of the original matrix is the same as the greatest common 
divisor of the 7-rowed determinants of the normal form (7) to which it is 
reduced. These last mentioned 7-rowed determinants are, however, all 
identically zero except those which are the product of 2 of the H’s. Let 


(8) Ei) Bj,.(X) +++ Ly) 

be any one of these, and suppose the integers k,, k,, --- k; to have 
been arranged in order of increasing magnitude. We obviously 
have k, 21,k,22,---k; 27%. Consequently F, is a factor of #,, H, 
of #,,, etc. Thus B\) EA) «> EQ) 

is seen to be a factor of (8), and, being itself one of the 7-rowed de- 
terminants of (7), it is their greatest common divisor. That is, 


THEOREM 3. The greatest common divisor of the i-rowed determt- 
nants of a r-matrix of rank r, when t <1, ts 
DX) = BO)BA) + LO), 
where the E’s are the elements of the normal form (7) to which the given 
matrix is equivalent. 
It may be noticed that this greatest common divisor is so deter- 
mined that the coefficient of the highest power of X in it is unity. 
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We come now to the fundamental theorem: 

THEOREM 4. A necessary and sufficient condition for the equiva- 
lence of two r-matrices of the nth order is that they have the same rank 
r, and that for every value of i from 1 to r inclusive, the i-rowed deter- 
minants of one matrix have the same greatest common divisor as the 
i-rowed determinants of the other. 

To say that this is a necessary condition is merely to restate 
Theorem 1. To prove it sufficient, suppose both matrices to be 
reduced to the normal form (7), where we will distinguish the 
normal form for the second matrix by attaching accents to the 
E’s init. If the conditions of our theorem are fulfilled, we have, by 


Theorem 3, EX) = E,(A), 
LO) BAA) = BOLO), 
BA) BA) BX) = BO) EO) EX), 


and, since none of these #’s are identically zero, it follows that 
H(A) = EXX) (6s A, 2) ory. 
Thus the normal forms to which the two »-matrices can be reduced 


are identical, and hence the matrices are equivalent, since two 
A-matrices equivalent to a third are equivalent to each other. 


EXERCISES 
1. Reduce the matrix 
ry ee 0 0 
OAS OO 0 
OO ae) 0 
OOM Ome Neat 0 
OPO RO Ole yt 


by means of elementary transformations to the normal form of Theorem 2. 
Verity the result by finding the greatest common divisors D,(X) first directly, 
and secondly from the normal form. 


2. By an elementary transformation of a matrix all of whose elements are 
integers is understood a transformation of any one of the following forms: 

(a) The interchange of two rows or of two columns. 

(6) The change of sign of all the elements of any row or column. 

(c) The addition to the elements of one row (or column) of the products of the 
corresponding elements of another row (or column) by one and the same integer. 

Starting from this definition, develop the theory of matrices whose elements are 
integers along the same lines as the theory of A-matrices was developed in this section. 
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92. Invariant Factors and Elementary Divisors. In place of the 
invariants DX) of the last section, it is, for most purposes, more 
convenient to introduee certain other invariants to which we will 
give the technical name invariant factors. As a basis for the defini- 
tion of these invariants we state the following theorem, which is 
merely an immediate consequence of Theorem 38, § 91: 


THEOREM 1. The greatest common divisor of the i-rowed determi- 
nants (t= 2, 3, --- r) of a X-matrix of rank r is divisible by the greatest 
_ common divisor of the (i—1)-rowed determinants of this matrix. 

DEFINITION 1. Jf a ts a X-matrix of rank r, and 

Dr) (= 12ers) 
the greatest common divisor of its i-rowed determinants so determined 
that the coefficient of the highest power of d is unity; and if D(r)=1; 
then the polynomial 

Dr) 


(1) BO)=5 ay 


is called the ith invariant factor of a. 





(i=1, 2, 7) 


This definition shows that these H’s are really invariants since 
they are completely determined by the D’s which we proved to be 
invariants in § 91. Moreover, by multiplying together the first 2 of 
the relations (1), we get the formula 


(2) DQ) = BQ) BA) BQ) @=1,2, + 2). 


This shows us that the H’s completely determine the D’s, and since 
these latter were seen in § 91 to form, together with the rank, a 
complete system of invariants, the same is true of the #’s. — That is, 


THEOREM 2. A necessary and sufficient condition that two r-ma- 
trices be equivalent is that they have the same rank r, and that the inva- 
riant factors of one be identical respectively with the corresponding 
invariant factors of the other. 

Since, in the case of a non-singular matrix of the nth order, 
D,(x) differs from the determinant of the matrix only by a constant 
factor, we see that in this case the determinant of the matrix is, 
except for a constant factor, precisely the product of all the invari- 
ant factors. This is the case which is of by far the greatest impor- 
tance, and the term invariant factor comes from the fact that the #’s 
are really factors of the determinant of the matrix in this case. 
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A reference to Theorem 3, § 91, shows that our invariant factors 
are precisely the polynomials HZ, which occur in the normal form of 
Theorem 2, § 91; and, since in that normal form each # is a factor 
of the next following one, we have the important result, 


TuroreM 3. If H,(), «+ EQ) are the successive invariant fac- 
tors of a r-matrix of rank r, then each of these E’s is a factor of the next 
following one. 


This theorem enables us to arrange the invariant factors of a \-ma- 
trix in the proper order by simply arranging them in the order of 
increasing degree, two H’s of the same degree being necessarily 
identical. 

The invariant factors (like the D’s of the last section) may be 
spoken of as rational invariants of our )-matrix since they are formed 
from the elements of the \-matrix by purely rational processes, 
namely the elementary transformations of § 91, which involve only 
the rational operations of addition, subtraction, multiplication, and 
divison. In distinction to these the elementary divisors, first intro- 
duced by Weierstrass, are, in general, irrational invariants.* These 
we now proceed to define. 


DEFINITION 2. Jf a ts a d-matrix of rank r, and D,(r) ts the 
greatest common divisor of the r-rowed determinants of a, then the linear 
piers each eA 
of D,(r) are called the linear factors of a.t 


Since, by formula (2), D,(\) is the product of all the invariant 
factors of a, it is clear that each invariant factor is merely the prod- 
uct of certain integral powers, positive or zero, of the linear factors 
of a. We may therefore lay down the following definition : 


* German writers, following Frobenius, use the term elementary divisor to cover 
both kinds of invariants. This is somewhat confusing, and necessitates the use of 
modifying adjectives such as simple elementary divisors for the elementary divisors 
as originally defined by Weierstrass, composite elementary divisors for the E’s. On 
the other hand Bromwich (Quadratic Forms and their Classification by Means of 
Invariant-factors, Cambridge, England, 1906) proposes to substitute the term tnvari- 
ant factor for the term elementary divisor. Inasmuch as this latter term is wholly 
appropriate, it seems clear that it should be retained in English as well as in German 
in the sense in which Weierstrass first used it. 

+ It will be noticed that if a is non-singular, the linear factors of a are simply the 
linear factors of the determinant of a. 
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DEFINITION 3. Let a be a d-matrix of rank r, and 


A—«a, A—a’, A—a!!, . 


ea 


ats distinct linear factors. Then if 


E{r)=(r— «(A — 0! H(A — a! « G@=1, 2, + 7), 
are the invariant factors of a, such of the factors 
(A—«@)*1, ’ (Aa), eee (X—«@)*r, 
(A— a! )%, (A—al yt, seeeee (n— 2b, 
(rX a al | ee (Ar P< a! i! ye, Rabi oa (X nd ot!" Ve", 


as are not mere constants are called the elementary divisors of a, each 
elementary divisor being said to correspond to the linear factor of which 
it is a power.* 

Since the invariant factors completely determine the elementary 
divisors and vice versa, it is clear that the elementary divisors are not 
merely invariants, but that, together with the rank, they form a 
complete system of invariants. That is, 


THEOREM 4. A necessary and sufficient condition that two \-ma- 
trices be equivalent is that they have the same rank and that the elemen- ~ 
tary divisors of one be identical respectively with the corresponding 
elementary divisors of the other. 


By means of Theorem 3 we infer the important result : 


THEOREM 5. The degrees e, of the elementary divisors correspond- 
ing to any particular linear factor satisfy the inequalities 
Cpe Cay (2,8, --- 1). 
By means of this theorem we can arrange the elementary divisors 
corresponding to any given linear factor in the proper order by simply 
noticing their degrees. 


* Tt will be seen that the definition just given is equivalent to the following one, in 
which the conception of invariant factors is not introduced : 

Derinition. Let \—& be a linear factor of the \-matrix a of rank r, and let li be 
the exponent of the highest power of \—a which is a factor of all the t-rowed determi- 
nants (iSr) ofa. If the integers e; (which are necessarily positive or zero) are defined 
by the formula =i —Li-1 (i=1, 2, --- 7), 
then such of the expressions C0), O—8)%, «- (0—0)* 


as are not constants are called the elementary divisors of a which correspond to the 
linear factor \—«. 
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EXERCISES 


1. If 6=0 and w=0 are two conics of which the second is non-singular, 
show how the number and kind of singular conics contained in the pencil 
 — Ay = 0 depends on the nature of the elementary divisors of the matrix of the 
quadratic form ¢ — AY. 

2. Extend Exercise 1 to the case of three dimensions. 

3. Apply the considerations of this section to matrices whose elements are 
integers. (Cf. Exercise 2, § 91). 


93. The Practical Determination of Invariant Factors and Elemen- 
tary Divisors. ‘The easiest general method for determining the 
invariant factors of a particular \-matrix is to reduce it by means of 
elementary transformations to the normal form of Theorem 2, § 91, 
following out step by step the reduction used in the proof of that theo- 
rem. From this normal form the invariant factors may be read off ; 
and from these the elementary divisors may be computed, although only, 
in general, by the solution of equations of more or less high degree. 

There are, however, many cases of great importance in which the 
elementary divisors may more easily be obtained by other methods. 
The most obvious of these is to apply the definition of elementary 
divisors directly to the case in hand. As an illustration, we mentiona 
matrix of the nth order which has «—X as the element in each place 
of the principal diagonal, while all the other elements are zero except 
those which lie immediately to the right of or above the elements of 
the principal diagonal, these being all constants different from zero: 


ee chen er ee aRD 
ee ay ro ay ah 

ee MM oe Ge 
0 0 0 sO AO oy 
0 0 99 Osc, 


The determinant of this matrix is (e—)”. The determinant 
obtained by striking out the first column and the last row is 
C1C_+*: Cy». Accordingly 

D,(rA)=(A— «)", Dr, yA)=1, £00) =(’— «)”. 


Thus we see that (A—«)” is the only elementary divisor of this 
matrix, while the invariant factors are (A — «)” and n—1 1’s. 


ELEMENTARY DIVISORS AND A-MATRICES 273 


This direct method may sometimes be employed to advantage in 
conjunction with the method of reduction by elementary transfor- 
mations. Cf. Exercise 1 at the end of this section. 

A further means of recognizing the elementary divisors in some 
special cases is furnished by the following theorems whose proofs, 
which present no difficulty, we leave to the reader: 


THEOREM 1. /Jf all the elements of a d-matrix are zeros except 
those in the principal diagonal, and if each element of this diagonal 
which is not a constant is resolved into the product of a constant by powers 
of distinct linear factors of the form \ — a, X—a!, ---, then these powers 
of linear factors will be precisely the elementary divisors of the matrix. 

THEOREM 2. Jf all the elements of a r-matrix are zeros except 
those which lie in a certain number of non-overlapping principal minors, 
then the elementary divisors of the matrix may be found by taking the 
elementary divisors of all these principal minors. 

The proof of this theorem consists in reducing the given matrix 
to the form referred to in Theorem 1 by means of elementary trans- 
formations each of which may be regarded as an elementary trans- 
formation of one of the principal minors in question. 

It should be noticed that this theorem would not be true if the 
words invariant factors were substituted in it for elementary divisors ; 
cf. Exercise 3 below. The invariant factors may, however, be com- 
puted from the elementary divisors when these have been found. 





EXERCISES 
1. Prove that the matrix 
y Meee al) Oe tet 0 0 
0° A=e 0 1 0 —1 0 
0 OA 0 eee 
B? 1 0 !A-a@ 0 0 
0 B? te Ome a -<0 
0 0 [i 0 A-« 
is equivalent to 
-1 0 0; 0 0 0 
Cet) oO 0 0 0 
0 oO -1) 0 0 0 
(6 0 A —4)!+ p 1 0 
0 oO oO1 0 (A — a)? + B? 1 
Oat 00 os 0 0 (A — a)? + B? 
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and hence that its elementary divisors are 
[A — (@ + fi)]*, [A— (@ — fi)]*. 
2. Generalize Exercise 1 to matrices of order 2n. 
3. Find (a) the elementary divisors, and (b) the invariant factors of the 


matrix AA — 1)? 0 0 0 | 
0 ACA — 1)8 0 0 
0 0 A-1 0 
0 0 0 r 
4. Determine the invariant factors and the elementary divisors of the matrix 
2X 3 0 1 r 
4X 3(A + 2) 0 A+2 22 
0 6X r Pw XY 
A-1 0 A-1 0 0 


3(A — 1) 1-A 2A-1) 0 0 
Ts this matrix equivalent to the matrix in the exercise at the end of § 91? 


5. Devise a convenient rational process for computing the invariant factors of 
matrices of the kinds considered in Theorems 1 and 2. 


94. A Second Definition of the Equivalence of \-Matrices. The 
definition of equivalence of \-matrices which we have used so far 
rests on the elementary transformations. These transformations are 
of such a special character that this definition is not convenient for 
most purposes. We now give a new definition which we will prove 
to be coextensive with the old one. 


DEFINITION. Two n-rowed \-matrices aand b are said to be equiv- 
alent if there exist two non-singular n-rowed X-matrices c and d whose 
determinants are independent of r, and such that 
GQ) b = cad.* 

Since the matrices c and d have, by hypothesis, constant determi- 
nants, the inverse matrices c~! and d will also be \-matrices, and not 
matrices whose coefficients are fractional rational functions of ) as would 
in general be the case for the inverse of A-matrices. Consequently, if 
we write (1) in the form 
(2) a= co bd, 
we see that the relation established by our definition between the 
matrices a and b is a reciprocal one, as is implied in the wording of 
the definition. 


* We use here and in what follows the sign = between two \-matrices to denote 
that every element of one matrix is identically equal to the corresponding element of 
the other. 
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In order to justify the definition just given, we begin by estab- 
lishing the 


LemMA. If aand bare n-rowed X-matrices, and the polynomial 
$(X) is a factor of all the i-rowed determinants of a, it is a factor of all 
the t-rowed determinants of ab and also of ba. 


For, by Theorem 5, § 25, every t-rowed determinant of ab and 
also of ba isa homogeneous linear combination of certain 7-rowed 
determinants of a. 


THEOREM 1. Jf aand b are equivalent according to the definition of 
this section, they are also equivalent according to the definition of § 91. 


For in this case there exist two non-singular \-matrices, ¢ and 
d, whose determinants are constants, such that relation (1) holds. 
Consequently, by Theorem 7, § 25,* a and b have the same rank r. 
Let DX) be the greatest common divisor of the t-rowed determi- 
nants of a, where ?<r. By our lemma, D,)) is a factor of all 
the z7-rowed determinants of ca, and therefore, applying the lemma 
again, it is a factor of all the z-rowed determinants of cad, that is, of db. 

We can infer further that D,) is the greatest common divisor 
of the 7-rowed determinants of b. For applying to relation (2) the 
reasoning just used, we see that the greatest common divisor of the 
z-rowed determinants of b is a factor of all the ¢-rowed determi- 
nants of a, and cannot therefore be of higher degree than Dn). 

A reference to Theorem 4, § 91, now shows us that a and b are 
equivalent according to the definition of that section. 


THEOREM 2. Jfaand bd are equivalent according to the definition 
of § 91, they are also equivalent according to the definition of the pres- 
ent section. 

We begin by showing that if we can pass from a matrix a to a 
matrix a, by means of an elementary transformation, one of the fol- 
lowing relations always holds: 

(3) aj=ca or a,=ad 

where c and d are non-singular matrices whose determinants are 
independent of ». To prove this we consider in succession the 
elementary transformations of the forms which were called (a), (6), 
(c), in Definition 1, § 91. 


* How is it that we have a right to apply this theorem to \-matrices ? 
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(a) Suppose we interchange the pth and gth rows. This can 
be effected by forming the product ca where the matrix c may be 
obtained by interchanging the pth and gth rows (or columns) in the 
unit matrix ay 


1 
Det est 80) 


O10) 20 Moan 
Similarly the interchange of the pth and qth columns of a may be 
effected by forming the product ac, where c has the same meaning 
as before. 

In each of these cases,c may be regarded as a non-singular 
A-matrix with constant determinant, since its elements are constants 
and its determinant is — 1. 

(6) To multiply the pth row of a by a constant k, we may form 
the product ca, where c differs from the unit matrix only in having 
k instead of 1 as the pth element of the principal diagonal. 

Similarly, we multiply the pth column of a by &, by forming the 
product ac, where c has the same meaning as before. 

If we take the constant # different from zero, c may be regarded - 
as a non-singular A-matrix with constant determinant. 

(c) We can add to the pth row of a (A) times the gth row by 
forming the product ca, where c differs from the unit matrix only in 
having ¢(A) instead of zero as the element in the pth row and qth 
column. 

Similarly we add to the gth column ¢(\) times the pth column 
by forming the product ac where c has the same meaning as before. 

The matrix c, whose determinant is 1, is a non-singular 
A-matrix. 

It being thus established that one of the relations (8) holds 
between any two A-matrices which can be obtained from one another 
by an elementary transformation, it follows that two matrices a 
and b which are equivalent according to the definition of § 91 will 
satisfy a relation of the form 


b = Cy Cpy_1 *°* cyad,d, Pa d, 


where each of the c’s and d’s is a non-singular A-matrix of constant 
determinant which corresponds to one of the elementary transforma- 
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tions we use in passing from a to b. This last relation being of the 
form . 
A b=cad, 


where ¢ and d are non-singular \-matrices with constant determi- 
nants, our theorem is proved. 

We have now completed the proof that our two definitions of 
the equivalence of A-matrices are coextensive. 


EXERCISES 


1. If a denotes the matrix in Exercise 1, § 91, and b the normal form of 
Theorem 2, § 91, for this matrix, determine two A-matrices, c and d, such that 
relation (1) holds. 

Verify your result by showing that the determinants of c and d are constants. 


2. Apply the considerations of this section to matrices whose elements are 
integers. Cf. Exercise 2, § 91, and Exercise 3, § 92. 


95. Multiplication and Division of \-Matrices. We close this 
chapter by giving a few developments of what might be called the 
elementary algebra of \-matrices. 


DEFINITION. By the degree of a r-matrix is understood the high- 
est degree in X of any one of its elements. 

For a A-matrix of the /th degree, the element in the 7th row andyth 
column may be written ay rt + alrtl 4 «+ al, 
and at least one of the coefficients of X* (7.e. one of the a,’s) must 
be different from zero. If, then, we denote by a, the matrix of 
which al?! is the element which stands in the 7th row and jth col- 
umn, we get the theorem 

THEOREM 1. Every \.matriz of the kth degree may be written in the form 
(1) fy MF + ayAF 1 «+ ay (a) #0) 
where ay +--a, are matrices with constant elements ; and conversely, 
every expression (1) is a r-matrix of degree k. 


THEOREM 2. The product of two r-matrices of degrees k and 1 
ayr* + ayAP 1+ oe + ay (ay #0) 
bod’ + DAY 1 ++ + dD, (by #0) 


is a r-matrix of degree k +1 provided at least one of the matrices a, and 
by ts non-singular. 
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For this product is a \-matrix of the form 
Coe a Coe Se Cyst 


where c, has the value agby or bya) according to the order in which 
the two given matrices are multiplied together. By Theorem 7, 
§ 25, neither ayb) nor bya) is zero if ay and by are not both singular. 

The next theorem relates to what we may call the division 
of A-matrices. 


THeoreM 8. Jf a and b are two r-matrices and if b, when written 
in the form (1), has as the coefficient of the highest power of X a non- 
singular matrix, then there exists one, and only one, pair of d-matrices 
q, and r, for which a=q,b+1, 
and such that either r,=0, or r, ts a A-matrix of lower degree than b; 
and also one and only one pair of X-matrices q, and ry for which 


a=bq.,+T, 
and such that either r,=0, or r, 7s a A-matrix of lower degree than b. 


The proof of this theorem is practically identical with the proof 
of Theorem 1, § 68. 


EXERCISE 


Derinition. By a real matrix is understood a matrix whose elements are real ; 
by a real X-matriz, a matrix whose elements are real polynomials in X; and by a real 
elementary transformation, an elementary transformation in which the constant in (b) 
and the polynomial in (ce), Definition 1, § 91, are real. 


Show that all the results of this chapter still hold if we interpret the words 
matrix, X-matrix, and elementary transformation to mean real matrix, real d-matriz, 
and real elementary transformation, respectively. 


CHAPTER XXI 


THE EQUIVALENCE AND CLASSIFICATION OF PAIRS OF 
BILINEAR FORMS AND OF COLLINEATIONS 


96. The Equivalence of Pairs of Matrices. The applications of 
the theory of elementary divisors with which we shall be concerned 
in this chapter and the next have reference to problems: in which 
A-matrices occur only indirectly. A typical problem is the theory 
of a pair of bilinear forms. The matrices aand b of these two forms 
have constant elements, and we get our A-matrix only by consider- 
ing the matrix a—Ab of the pencil of forms determined by the two 
given forms. It will be noticed that this matrix is of the first 
degree, and in fact we shall deal, from now on, exclusively with 
A-matrices of the first degree. 

By the side of this simplification, a new difficulty is introduced, 
as will be clear from the following considerations. We shall subject 
the two sets of variables in the bilinear forms to two non-singular 
linear transformations whose coefficients we naturally assume to be 
constants, that is, independent of >. These transformations have the 
effect of multiplying the A-matrix, a— Ab, by certain non-singular 
matrices whose elements are constants (cf. § 36) and therefore, by 
§ 94, carry it over into an equivalent \-matrix which is evidently of 
the first degree. The transformations of § 94, however, were far 
more general than those just referred to, so that it is not at all ob- 
vious whether every A-matrix of the first degree equivalent to the 
given one can be obtained by transformations of the sort just re- 
ferred to or not. 

These considerations show the importance of the following 
theorem : 


THEOREM 1. Jf aj, ag, Dy, b, are matrices with constant elements 
of which the last two are non-singular, and if the’ d-matrices of the first 
degree m, =a, — Ab, m, =a, — Ab, 
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are equivalent, then there exist two non-singular matrices, p and q, 
whose elements are independent of r, and such that 


(1) m, = pm,q. 


Since m, and m, are equivalent, there exist two non-singular 
A-matrices, py and qo, whose determinants are constants and such that 


(2) ; M, = PyM, qo: 


The matrix q, has, therefore, an inverse, qj1, which is also a 
A-matrix. 

Let us now divide p, by m, and qj)! by m, by means of Theorem 
3, § 95, in such a way as to get matrices p,, p, s,s which satisfy the 
relations 
(3) Po = MyP; + P» qo'=S;m, +s, 


p and s being matrices whose elements are independent of >» From 
2) we get = 
eee ppm, =m,q5l. 


Substituting here from (3), we have 


M,P,M, + pm, = M_s,M, + MS, 
or 
(4) m,(p; — $,)m, = m,s — pm. 


From this identity we may infer that p, =s, and therefore 
(5) m,s =pm. 


For if p,—s, were not identically zero, m,(p,—s,) would be a 
A-matrix of at least the first degree (cf. Theorem 2, $95), and hence 
the left-hand side of (4) would be a \-matrix of at least the second 
degree. But this is impossible, since the right-hand side of (4) 
is a A-matrix of at most the first degree. 

If we knew that p and s were both non-singular, our theorem 
would follow at once from (5); for we could write (5) in the form 


(6) m, = pm,s"! 


and pand s~! would be non-singular matrices with constant elements. 
Moreover, we see from (5) that p and s are either both singular or 
both non-singular. Our theorem will thus be proved if we can 
show that s is non-singular. 
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For this purpose let us substitute in the identity 
’ 1=4q,95! 
for qo its value from (3), 
(7) IT=q,s;m, + qs. 
Now divide q) by m, by means of Theorem 8, § 95, in such a way as 
to get 
(8) — do = 4m, + 4 
where q is a matrix with constant elements. 
Substituting this value in (7), we have 


I=q,)s,m, + q,m,s + qs. 


Referring to (5), we see that this may be written 
(9) I— qs = (qs; + ayP)m). 


From this we infer that q)s,-+q,p must be identically zero, and 


therefore 
(10) I=gQs. 


For if qos; -+4q,p were not identically zero, the right-hand side of 
(9) would be a X-matrix of at least the first degree, while the left- 
hand side of (9) does not involve 2. 

Equation (10) shows that sis non-singular, and thus our theorem 
is proved. It shows us, however, also that q is non-singular, and 
that q=s71, so that equation (6) becomes m, = pm,q. 

We may, therefore, add the following 


Corou“uARyY. The matrices p and q whose existence is stated in the 
above theorem may be obtained as the remainders in the division of po 
and qy in (2) by m, by means of the formule : 

Po = ™,P, + P, Qo = 41M, + 4. 


From this theorem concerning A-matrices of the first degree we 
can now deduce the following theorem concerning pairs of matrices 
with constant elements. It is this theorem which forms the main 
foundation for such applications of the theory of elementary divisors 
as we shall give. 

We shall naturally speak of two pairs of matrices with constant 
elements a,, b, and a,, b, as equivalent if two non-singular matrices p 
and q exist for which 

(11) a, = pa,q, b, = pb,q. 
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TurorEM 2. Jf a, b, and ay, by are two pairs of matrices 
independent of Xr, and if b, and by are non-singular, a necessary 
and sufficient condition that these two pairs of matrices be equiva- 
lent is that the two d-matrices 


have the same invariant factors, — or, if we prefer, the same elementary 
divisors. 


For if-the pairs of matrices are equivalent, equations (11) 
hold; hence, multiplying the second of these equations by » 
and subtracting it from the first, we have 


(12) . m, =pm,q, 


that is the \-matrices m, and m, are equivalent, and therefore have 
the same invariant factors, and the same elementary divisors. On 
the other hand, it follows at once from the assumption that b, and b, 
are non-singular, that m, and m, are non-singular, and hence have 
the same rank. Consequently if m, and m, have the same invariant 
factors, or the same elementary divisors, they are equivalent. Since 
they are of the first degree, there must, by Theorem 1, exist two 
non-singular matrices p and q, whose elements are independent of A, 
which satisfy the identity (12). From this identity, the two equa- 
tions (11) follow at once ; and the two pairs of matrices are equivalent. 
Thus the proof of our theorem is complete. 

A case of considerable importance is that in which the matrices 
b, and b, both reduce to the unit matrix I. In this case m, and m, 
reduce to what are known as the characteristic matrices of a, and a, 
respectively, according to the following definition : 


DEFINITION. Jf a ts a matrix of the nth order with constant ele- 
ments and I the unit matrix of the nth order, the \-matrix 


A=a-~l 


is called the characteristic matrix of a; the determinant of A is called 
the characteristic function of a; and the equation of the nth degree in 
r formed by setting this determinant equal to zero is called the char- 
acteristic equation of a. 
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We can now deduce from Theorem 2 the following more special 
result : 

THEOREM 8. Jf aj and a, are two matrices independent of d, a neces- 
sary and sufficient condition that a non-singular matrix p exist such that * 


(13) a, = pap! 


is that the characteristic matrices A, and A, of a, and a, have the same 
invariant factors, — or, if we prefer, the same elementary divisors. 

For if A, and A, have the same invariant factors (or elementary 
divisors), there exist, by Theorem 2, two non-singular matrices p 
and q such that ay = pa,d, I= plq. 

The second of these equations shows us that q=p7!; and this 
value being substituted in the first, we see that p is the matrix whose 
existence our theorem asserts. 

That, on the other hand, A, and A, have the same invariant factors 
and elementary divisors if equation (13) is fulfilled, is at once obvious. 


97. The Equivalence of Pairs of Bilinear Forms. Suppose we 
have a pair of bilinear forms in 2 variables 


$i= My ViY p v= LOG UY 
and also a second pair 
Sal! S pl! 
$2 = LAG MY pp Yo = 2 Oy 2iYp 
and let us assume that yw, and y, are non-singular. We will in- 
quire under what conditions the two pairs of forms are equivalent, 
that is, under what conditions a first non-singular linear transforma- 
tion for the z’s and a second for the y’s, 
By = Cy 2 + + HOT Yx= Ay Yt +AinYn 
Si ee a Co ie aap ae 
a Y Bae = ! ew og al 
Ln = Cny Vy fe CrnXn Yn= 21414 + nn Yn 
can be found which together carry over ¢, into ¢, and yy, into po. 


* Two matrices connected by a relation of the form (13) are sometimes called 
similar matrices. This conception of similarity is evidently merely a special case of 
the general conception of equivalence as defined in § 29, the transformations considered 
being of the form (18) instead of the more general form usually considered in this 
- chapter and the last. 
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If we denote the conjugate of the matrix c by c’ and the matrices 
of $1, Ws bor Wy by ay, Dy, Ag, Dy respectively, we know, by Theorem 1, 
§ 36, that the transformations c,d carry over ¢, and y, into forms 
with matrices cla.d, c'b,d ;. 


respectively; so that, if these are the forms ¢, and y,, we have 
(1) a, = c/a,d, b, = c'b,d. 


Consequently, by Theorem 2, § 96, the two \-matrices 

ay a rb, a, 2 Ab, : 
have the same invariant factors and elementary divisors. 

Conversely, by the same theorem, if these two \-matrices have 
the same invariant factors (or elementary divisors), two constant 
matrices c’ and d exist which satisfy both equations (1); and hence 
there exists a linear transformation of the 2’s and another of the y’s 
which together carry over ¢, into ¢, and w, into y,. Thus we 
have proved the 


THEOREM. Jf ¢,,W, and ¢,, Wy are two pairs of bilinear forms in 2n 
variables of which Wr, and Wr, are non-singular, a necessary and sufficient 
condition that these two pairs of forms be equivalent is that the matrices 


of the two peneils Cre As by — hy 


have the same invariant factors,—or, if we prefer, the same elementary 
divisors.* 
EXERCISE 
Prove that the theorem of this section remains true if the bilinear forms 


1, 1, $2, We are real and the term equivalent is understood to mean equivalent with 
regard to real non-singular linear transformations. 


98. The Equivalence of Collineations. A second important appli- 
cation of the theory of elementary divisors is to the theory of col- 
lineations. For the sake of simplicity we will consider the case of 


two dimensions pies 
Ly = Ay, 2% + AyQ% + Ag 2g, 


/ 
A 1 Vy = Mp1 Ly + Aga %y + Ag Upp 
Uy = gy y + Agy®y + Ag Xyp 
although the reasoning will be seen to be perfectly general. 


* For the sake of brevity, we shall, in future, speak of these invariant factors and 
elementary divisors as the invariant factors and elementary divisors of the pairs of 
forms 1, ¥1 and go, We respectively. 
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We have so far regarded a collineation merely as a means of 
transforming certain geometric figures. It is possible to adopt an- 
other point of view, and to study the collineation in itself with 
special reference to the relative position of points before and after 
the transformation. Thus suppose we have a figure consisting of 
the points A,, A,, ---, finite or infinite in number, and suppose these 
points are carried over by the collineation a into the points Aj, 
Al, ---. These two sets of points together form a geometric figure. 
It is the properties of such figures as this that we call the properties 
of the collineation. Such properties may be either projective or 
metrical. Thus it would be a metrical property of a collineation if 
it carried over some particular pair of perpendicular lines into a pair 
of perpendicular lines; it would be a projective property of the 
collineation if it carried over some particular triangle into itself. We 
shall be concerned only with the projective properties of collineations. 

As an example, let us consider the fixed points of the collineation, 
that is points whose initial and final position is the same. In order 
that (x1, 2, 2,) be a fixed point it is necessary and sufficient that 


jin me ee ae 
L, = AX, Lg=AVy 83 = AL, 


that is, substituting in a, that a constant » exist such that, 


(44, —d)ey + O49 X_ + A3%3 = 0, 
(1) Ay %y + (gg — ®)ly + Ang%3 = 0, 
fyenes igo,%q + (gg — dg = 0. 


The matrix of this system of equations is precisely what we have 
called the characteristic matrix of the matrix a of the linear trans- 
formation. The characteristic function is a polynomial of the third 
degree in X which, when equated to zero, has one, two, or three dis- 
tinct roots. Let , be one of these roots. When this is substituted 
in (1), these equations are satisfied by the codrdinates of one or more 
points,—the fixed points of the collineation a. The number and 
distribution of these fixed points give an important example of a 
projective property of a collineation; and it is readily seen that 
collineations may have wholly different properties in this respect, 
one having three fixed points, another two, and still another an 
infinite number. 

Coming back now to the two sets of points A,, A,,--- and Aj, 
Aj, --- which correspond to one another by means of the collinea- 
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tion a (which may be singular or non-singular), let us subject all these 
points to a non-singular collineation c, which carries over Ay, Aj, --- 
into B,, B, --- and Aj, Aj, --- into Bi, Bj, --- respectively. The fig- 
ure formed by the B's will have the same projective properties as 
that formed by the A’s; and consequently if we can find a collinea- 
tion b which carries over B,, B,, --- into Bi, Bh, ---, this collineation 
will have the same projective properties as the collineation a. Such 
a collineation is clearly given by the formula 
(2) Decne 
since c~! carries over the points B; into the points A;, a then carries 
over these into A/, and c carries over the points A} into the points Bj. 
Since two collineations a and b related by formula (2) are indis- 
tinguishable so far as their projective properties go (though they 
may have very different metrical properties), we will call them 
equivalent according to the following 


DEFINITION. Two collineations a and b shall be called equivalent 
af a non-singular collineation c exists such that relation (2) is fulfilled. 


A reference to Theorem 3, § 96, now gives us the fundamental 
theorem: 


THEOREM. <A necessary and sufficient condition that two collinea- 
tions be equivalent 2s that their characteristic matrices have the same in- 
variant factors, — or, of we prefer, the same elementary divisors. 


EXERCISES 


1. If Py, Ps, ++» P;, are fixed points of a non-singular collineation in space of 
n — 1 dimensions which correspond to k distinct, roots of the characteristic equa- 
tion, prove that these points are linearly independent. 
2. Discuss the distribution of the fixed points of a collineation 
(a) in two dimensions, 
(6) in three dimensions, 
for all possible cases of non-singular collineations. 
8. Discuss the distribution of 
(a) the fixed lines of a collineation in two dimensions, 
(0) the fixed planes of a collineation in three dimensions, 
for all possible cases of non-singular collineations; paying special attention to their 
relation to the fixed points. 
4. Two real collineations, a and b, may be said to be equivalent if there exists 
a real non-singular collineation ¢ such that b= cac-1. 
With this understanding of the term equivalence, show that the theorem of the 
present section holds for real collineations. 


a 
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99. Classification of Pairs of Bilinear Forms. We consider again 
the pair of bilinear forms 


i. LAjLiYjp p= LWyxy; 
of which we assume: the second to be non-singular, and form the 


A-matrix. 


(1) a— Nb. 


Using a slightly different notation from that employed in § 92, we 
will denote the elementary divisors of (1) by 


(vr = ry) (r = Ne) Spee ae (rX = rn) (e, + ey ses. = nN), 


so that the linear factors ) — A, need not all be distinct from one 
another. The most important thing concerning these elementary 
divisors is, for many purposes, their degrees, e,, é, --- e, When we 
wish to indicate these degrees without writing out the elementary 
divisors in full, we will use the symbol [e, e, --- e,], called the char- 
acteristic of the \-matrix (1), or of the pair of forms ¢,y. It will be 
seen that this characteristic is a sort of arithmetical invariant of the 
pair of bilinear forms, since two pairs of bilinear forms which are 
equivalent necessarily have the same characteristic. The converse 
of this, however, is not true, since for the equivalence of two pairs of 
bilinear forms the identity of the elementary divisors themselves, 
not merely the equality of their degrees, is necessary. 

All pairs of bilinear forms which have the same characteristic are 
said to form a category. ‘Thus, for example, in the case of pairs of 
bilinear forms in six variables we should distinguish between three 
categories corresponding to the three characteristics, 


(iets i, (oer 1], [3], 


which are obviously the only possible ones in this case. Ith fact, we 
must inquire whether these three categories really all exist. This 
question we answer in the affirmative by writing down the following 
pairs of bilinear forms in six variables which represent these three 
categories : 


1 Sr a Sp ly AyPils + Aaa + eal 


LY, + Yn t+ WY yr 
tee. Oo 0 

Cie re at 

0 Oni wal teeny 
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Ny 1 + ALY 2 + MyYq + AgU3Y 35 


Il. [21] 





LyYy~ tr LYo + L3Yzp 
wes ele 0 
ew 0 Ay — a» 
Ill [3] Mey aN NyLoYo AN: MyX3Y5 ue L4Yo am LY 39 
DY, + eYgtr LY» 
(ey a6 


The pairs of bilinear forms we have just written down do more 
than merely establish the existence of our three categories. They 
establish the fact that not only the degrees of the elementary divisors 
are arbitrary (subject merely to the condition that their sum be 
three), but that, subject to this restriction, the elementary divisors 
themselves may be arbitrarily chosen. They are, moreover, normal 
forms to one or the other of which every pair of bilinear forms in 
six variables, of which the first is non-singular, may be reduced by 
non-singular linear transformations. 

The general theorem here is this: 


THEOREM. Jf Ay, Ao, --> Ay, are any constants, equal or unequal, and 
C1, Cn “°C, are any positive integers whose sum is n, there exist pairs of 
bilinear forms in 2n variables, the second form in each pair being non- 
singular, which have the elementary divisors 


(2) COs OS ee Ome 


The proof of this theorem consists in considering the pair of 
bilinear forms 


ey e ete, eye, 
PS (Src, ae Soy) (“Sana a os Via : 


CN Rag he $ : 
a BS na Yit pee ath), 


a = 2191 as Lo Yo, shay cts TrY no 
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of which the second is non-singular. These forms have a \-matrix 
which may be indicated, for brevity, as 


(4) Srp ee gee 


5 


where the letters M,, --- M, represent not single terms but blocks of 
terms ; M, standing for the matrix of order e; 


Moe 1g Oh es 0 
ae ees Sing a aaa 
M; = 5 . . . . . . e : . . 3 
0 0 0 ees A; —X 


while all the terms of the matrix (4) are zero which do not stand in 
one of the blocks of terms M;. The elementary divisors of (4) are, 
as we see by a reference to § 93 (Formula (1) and Theorem 2), pre- 
cisely the expressions (2). Thus our theorem is proved. 

A reference to § 97 shows that formula (3) is a normal form to 
which every pair of bilinear forms in 2” variables with the ele- 
mentary divisors (2) can be reduced.* 


* Many other normal forms might be chosen in place of (8). Thus, for instance, 
we might have used in place of (3) the form 
a é,—1 e,+é, ¢,+e,—1 
o= (2rrceie,—i41 + Zdyeie,-+) + (Srycytilar rey ae 2autatctert) 
1 e,+1 


f soe (Srvc. iY 9n—ep—t+-1 + me), 
n-ex+1 nm—ext 


3’) ey e+e, €,+es te, 
y= owt, Sil Zeit itl + Leslie, +2e,+e,-it1 
e,tegtl 


5 ie a Deuei Gennes 
n—ez+1 
where the constants ¢1, «++ Cy, di, --- dz may be chosen at pleasure provided, merely, 
that none of them are zero. For instance, they may all be assigned the value 1, 
U 
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Let us now return to the classification of pairs of bilinear forms. 
For a given number, 2 n, of variables we have obviously only a finite 
number of categories. We may subdivide these categories into 
classes by noticing which, if any, of the elementary divisors corre- 
spond to the same linear factor. This we can indicate in the char- 
acteristic by connecting by parentheses those integers which are the 
degrees of elementary divisors corresponding to one and the same 
linear factor. Thus, in the case n = 8, the characteristic 


[2 1)(1.1 1)2] 


would indicate that the ’-matrix has just three distinct linear fac- 
tors; that to one of these there correspond two elementary divisors 
of degrees two and one respectively, to another three elementary 
divisors of the first degree, and to the last a single elementary 
divisor of degree two. 

Two pairs of bilinear fore which are equivalent belong neces- 
sarily to the same class, but two pairs of bilinear forms which be- 
long to the same class are not necessarily equivalent. 

To illustrate what has just been said, let us again consider the 
case n=3. Here we have now, instead of three categories, six 
classes, which are exhibited in the following table: 


a b he 
il (fa 0) heh ya) Cr al 
II. [21] [(21)] 
III. [3] 




















The \-matrix of this pair of forms may be written in the form (4), where, how- 
ever, M; now stands for the matrix of order e;: 


0 o- 0 d; (A; — d) 
MS hic ee eee ae eal 
D1Qy han 0 0 


It will be noticed that the matrices M,, and therefore also the bilinear forms (8/), 
are symmetrical, a fact which will make this normal form important when we come to 
the subject of quadratic forms in the next chapter. 

Constants similar to the constants ¢; and d; which we have introduced in (8') 
might also have been introduced in (8).. 
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The three classes Ia, Ib, Ie form together the category I, and are 
all represented by the normal form given for that category above, 
the only difference being that in class Ia the three quantities X,, Xo 
Ag are all distinct, in class Id two, and only two, of them are equal, 
while in class Ie they are all equal. Similarly category II is now 
divided into two classes, Ila and I18, for both of which the normal 
form of category II holds good, A, and A, being, however, different 
in that normal form for class IIa and equal for class IIb. Finally 
category III consists of only a single class. 

For some purposes it is desirable to carry this subdivision still 
farther. The second of our two bilinear forms, vr, has been assumed 
throughout to be non-singular. The first, ¢, may be singular or 
non-singular; and it is readily seen that a necessary and sufficient 
condition that ¢ be singular is that one, at least, of the constants 2, 
which enter into the linear factors of the \-matrix be zero. Thus it 
_ will be seen that in a single class we shall have pairs of bilinear 
forms both of which are non-singular and others one of which is 
singular, and we may wish to separate into different sub-classes 
the pairs of forms which belong to one or the other of these two 
cases. 

Let us go a step farther in this same direction, and inquire how 
the rank of ¢ is connected with the values of the constants r,. We 
notice that the matrix of ¢ is equal to the matrix of the pencil d—Ap 
when A=0. Accordingly, if ¢@ is of rank r, every (r+ 1)-rowed 
determinant of the matrix of 6 —Aw will be divisible by X, while at 
least one r-rowed determinant of this matrix is not divisible by 2. 
It is then necessary, as we see by a reference to the definition of 
elementary divisors (cf. the footnote to Definition 38, § 92), that just 
n—r of the constants >; which enter into the elementary divisors 
should be zero. Since the converse of these statements is also true, 
we may say that a necessary and sufficient condition that the form 
be of rank r is that just n—r of the elementary divisors be of the form 
r%. Let us, in the characteristic [e, e, --- e,], place a small zero 
above each of the integers e,; which is the degree of such an elemen- 
tary divisor; and regard two pairs of bilinear forms as belonging to. 
a single class when, and only when, their characteristics coincide in 
the distribution of these zeros as well as in other respects. Here 
again two equivalent pairs of forms will always belong to the same 
class, but the converse will not be true. 
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As an illustration, let us again take the casen=3. We have 
now fourteen classes instead of six. 


[111], [4 11}, (411), (21), [2], (3), (r=3), 
(111), (4.1) 21) 21, 8) (r = 2), 
[d 4) 1}, (24), (r=1), 
[didy, (ie 


We have indicated, in each case, the rank 7 of the form ¢. Thus 
in the first six cases ¢ is non-singular; in the next five it is of rank 
2, etc. 

EXERCISES 

1. Prove that there exist pairs of real bilinear forms in 2 n variables of which 

the second is non-singular, and which have the elementary divisors 

(A-Ay)%, (A= Ag) tees (A-Ax)% (4, +@,+ ++ +e,=n), 
provided that such of these elementary divisors as are not real admit of arrange- 
ment in conjugate imaginary pairs. (Cf. Exercises 1,-2, § 93.) 


2. Classify pairs of real bilinear forms in six variables (the second form 
in each pair being non-singular), distinguishing between real and imaginary 
elementary divisors. “ 


100. Classification of Collineations. ‘The classification of pairs 
of bilinear forms which we gave in the last section may obviously 
be regarded, from a more general point of view, as a classification 
of pairs of matrices, the second matrix of each pair being assumed 
to be non-singular. From this point of view it admits of applica- 
tion to the classification of collineations, since, as we saw in § 98, to 
every collineation corresponds a pair of matrices of which one is 
non-singular, namely the unit matrix I and the matrix of the linear 
transformation. Moreover, the normal form (3) of § 99 is precisely 
adapted to the treatment of the more special kind of equivalence 
which we have to consider here, since the matrix of the form is 
precisely the unit matrix. We may therefore state at once the 
fundamental theorem : 


THEOREM 1. Jf dy, Ag +++ Ay are any constants, equal or unequal, 
and 1, €, ++ &, any positive integers whose sum is n, there exists a col- 
lineation in space of n—1 dimensions whose characteristic matrix has 
the elementary divisors 


(% Ags: OME Yee (A — Ay). 
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To this we may add 


THEOREM 2. very collineation of the kind mentioned in Theorem 
1 ts equivalent to the collineation whose matrix is 


5 





where M;, stands for the matrix of order e,, 
ri if! 0 Shee 0 


0 nr; 1 cieit 0 
M, = 


Oe Oe 0. 2 
We-thus get a classification of collineations into categories and 
a subdivision of these categories into classes precisely as in § 99. 


For instance, in the case n = 3 (collineations in the plane), we have 
three categories whose characteristics and representative normal 


forms we give: a, 
es t= Nats 
ee Nata: 


i NP 4 Le 
ae (2 1] pe 425 


pas 
Lz = AgX3y 


iI, [8] ra Ay %_ + Ly 


These categories we should then subdivide either into six classes 
as on page 290 or into fourteen classes as on page 292. This latter 
classification is the desirable one in this case. We proceed to give a 
list of these fourteen classes with a characteristic property of each. 
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That the normal forms of the collineations have these properties 
will be at once evident, and from this it follows that all the collinea- 
tions of the class have the property in question, since the properties 
mentioned are obviously all projective. That the properties men- 
tioned are really characteristic properties, that is, serve to distin- 
guish one class from another, can only be seen a posteriori, by noticing 
that no one of the properties mentioned is shared by two classes. 
[111] Three distinct non-collinear fixed points.* 


[(11)1] Every point of a certain line and one point not on 

this line are fixed. 

[(111)] The identical collineation. 

[21] Two distinct fixed points. 

[(21)] Every point of a certain line is fixed. 

[3] One fixed point. 

In all these cases the collineation is non-singular. The remain- 
ing collineations are singular. In the next three, one point P of the 
plane is not transformed at all, while all other points go over on to - 
a line » which does not pass through P, and every one of whose 
points corresponds to an infinite number of points. 


pi 1] There are two fixed points on p. 

[«1) 1] Every point on p is fixed. 

[2 1] One fixed point on p. 

In the next two cases one point P is not transformed at all, 
while all other points go over on to a line p which passes through P, 


and every one of whose points corresponds to an infinite number of 
points. 


0 
[21] One fixed point. 
0 
[3] No fixed point. 
The remaining collineations are so simple that they are not merely 
characterized, but completely described, by the property we mention. 


0 0 
[(11)1] The points on a certain line are not transformed. All 
other points go over into a single point which does 
not lie on this line. 


* Tt should be understood here and in what follows that the fixed points which 
are mentioned are the only fixed points of the collineation in question. 
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00 
[(21)] The points on a certain line are not transformed. All 
other points go over into a single point on this line. 


000 
{(111)] No point in the plane is transformed. 
This last case is of course not a transformation at all. 


EXERCISES 


1. Classify, in a similar manner, the projective transformations in one 
dimension. 


2. Classify the collineations in space of three dimensions. 


3. Classify the real projective transformations in space of one, two, and three 
dimensions. (Cf. Exercises 1,2, § 99.) 


CHAPTER XXII 


THE EQUIVALENCE AND CLASSIFICATION OF PAIRS OF 
QUADRATIC FORMS 


101. Two Theorems in the Theory of Matrices. In order to jus- 
tify the applications we wish to make of the theory of elementary 
divisors to the subject of quadratic forms, it will be necessary for us 
to turn back for a moment to the general theory of matrices. 


DeFINiTION. Jf G(x) is a polynomial : 
G(r) = aya" + aa fan E+ Ag 
then AyX” + a,x" 14 +. 4 4y 1X + Ayl 
is called a polynomial in the matrix x and is denoted by $(x).* 


We come now to one of the most fundamental theorems in the 
whole theory of matrices: 

THEOREM 1. Jf ais a matrix, and (A) its characteristic fune- 
tion, then (a) = 0. 

This equation is called the Hamilton-Cayley equation. 

Let c be the characteristic matrix of a: 

c=a-—Al. 

This being a A-matrix of the first degree, its adjoint C will be a 

A-matrix of degree not higher than n—1, if n is the order of the matrix a: 


(1) Cac rt ne ese ce 
We may also write 
e. P(A) = hd” + Ie yN 1  o + hy 


Now referring to formula (5), § 25, we see that 
aC — AC = *P(AIJI. 


* It should be noticed that, according to this definition, the coefficients of a poly- 
nomial in x are scalars. Contrast this with a \-matrix, in which the coefficients are 
matrices and the variable a scalar. Both of these conceptions would be included in 
expressions of the form : 

aoX™Do + arx™—1by + ++ + Am—1XDm—1 + Am. 
296 
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Substituting here from (1) and (2), we have, on equating corre- 
sponding powers of a aC, eat 
aC, = 50, = 41, 


ACA Ch eT, 
=e C,-4 = els 


If we multiply these equations in succession by I, a, a2, --- a”, 
and add, the first members cancel out, and we get 


Kol + kya + kyat®+ --- +h,” = 0. 


This is precisely the equation 
ola) = 0 
which we wished to establish. 
As a means of deducing our second theorem, we next establish a 
lemma which relates merely to scalar quantities. 


LemMA. Jf W(x) is a polynomial of the nth degree (n>0) whose 
constant term is not zero, there exists a polynomial x(x) of degree less 
val . sh that (x(a) P— 2 
is divisible by Wx). 

Let x—a, x—b, x—c, +» be the distinct linear factors of yz), 
so that we may write 


f(r) = Kx —a)(a— b)(x—e)Y + (a+B+y+ + =n). 


None of the constants a, 6, ¢, --- are zero, since, by hypothesis, the 
constant term of y is not zero. Let us, further, denote by y,(x) the 
polynomial obtained from by omitting the factor (w—a)*, by 
a(x) the polynomial obtained from y by omitting the factor (w7— bye, 
etc., and finally let us form, with undetermined coefficients, the 
polynomials 

A(z) = A, +A,(x— a) + Afa—aP+ + +A,y(2- 4), 

B(x) = B, + Bila —b) + Blv— bP + ++ + Be(x— 6), 

Oax)= C+ O(e—e)+ Of2—eP+ ve + C1-(@ — ey}, 
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From these polynomials we now form the polynomial 
x(x) = AC@)i(2) + Bele) + Cape) + - 


whose degree can obviously not exceed n—1. We wish to show 
that the coefficients A, B, --- can be so determined that this poly- 
nomial X(«) satisfies the conditions of our lemma. 


Since y,, Wr, --- are all divisible by (2—a)*, a necessary and 
sufficient condition that (y(x))? — x be divisible by this factor is that 
the polynomial $a) =(A@)Hi@)—2 


be divisible by (w— a)". We have 
(a) = A2k{a —b)%(a —cP --- —a. 


In order that ¢(x) be divisible by x — ait is therefore necessary and 
sufficient that is 
oe : 

®) AO aKa — oh 

Neither numerator nor denominator here being zero, we thus 
obtain two distinct values for A), both different from zero. If we 
give to A, one of these values, ¢(~) is divisible by —a. A neces- 
sary and sufficient condition that it be also divisible by (#—a/ is 
that ¢'(a) = 0, accents here, and in what follows, denoting differentia- 
tion. We shall see in a moment that this condition can be imposed 
in one, and only one, way by a suitable choice of A,. The condi- 
tion that f(z) be divisible by (w — a)? is then simply $/(a)=0. We 
wish to show that this process can be continued until we have finally 
imposed the condition that ¢(a) be divisible by (a#—a)*. For this 
purpose we use the method of mathematical induction, and assume 
that A, --- A, have been so determined that ¢(a)=¢/(a)= -- 
= ¢§1(a)=0. It remains then merely to show that A, can be so 
determined that ¢&(a)=0. For this purpose we notice that 


(4) P(x) = 2AN(@ AC Wy(x)P + B(x) 


where R(x) is an integral rational function with numerical coeffi- 


cients of Wy, Wi, --- Wil, A, A’, --- A’-ML Since 
A(a) = Ay, A'(a)=A,, Al(a) = 21A,, -+» AbAKa) = (8 —1)!4,_4, 


it follows that A(a) is a known constant, that is, that it does not 
depend on any of the still undetermined constants A,,A,.,, --+ A, 
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nor on the B’s,-0’s, etc. Consequently we see from (4) that a neces- 
sary and sufficient condition that $a) = 0 is that A, have the value 
5) Ada Ria) . 

) © 2s8lAy (W,(a)? 

Determining the coefficients A,, Aj, --- A,_, in succession by means 
of this formula, we finally determine the polynomial A(z) in such a 

way that ¢(z) is divisible by (w@—a)*. For this determination, 

(x(x)? — x will, as we saw above, be divisible by (#— a)*. 

In the same way we can now determine the coefficients of B(x) so 
that (x(x))? — 2 is divisible by (2 — 6)®; then we determine the coeffi- 
cients of C(x) so that (y(x))? — 2 is divisible by (a— cc)’; ete. When 
all the polynomials A, B, C, --- are thus determined, (X(x))? — x is 
divisible by ¥(x), and our lemma is proved. 


THEOREM 2. [fais a non-singular matrix of order n, there exist ma- 
trices b of order n (necessarily non-singular) with the following properties : 
b—a, 
b zs a polynomial in a of degree less than n. 
Since a is non-singular, its characteristic function ¢() is a poly- 
nomial of the nth degree whose constant term is not zero. Hence, by 
the preceding lemma, a polynomial y(A) of degree less than m can be 


determined such that (x(X) 2 — 2 = H(A) F(A) 


where f(X) is also a polynomial. From this identity it follows that 
(x(a)? — a= $(a) f(a): 
Since, by Theorem 1, ¢(a) = 0, the last equation may be written 
(x(a) P= a 


so that b= y(a) is a matrix satisfying the conditions of our theorem, 
which is thus proved. | 


102. Symmetric Matrices. The application of the theory of ele- 
mentary divisors to the subject of quadratic forms rests on the fol- 
lowing proposition: 
TuHeorEM 1. Jf a, and a, are symmetric matrices and if there 
exist two non-singular matrices p and q such that 


(o} a, = pa; q, 
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then there also exists a non singular matrix P such that 
(2) a, == P’a,P 
where P! is the conjugate of P.* 

Let us denote by p’ and q’ the conjugates of p and q respectively. 
Taking the conjugates of both sides of (1), and remembering that 
a, and a,, being symmetric, are their own conjugates, we get, by 
Theorem 6, § 22, 

(3) ay = 4/a,p'. 
By equating the values of a, in (1) and (3), we readily deduce the 
further relation 


(4) (q') "pa, = a,p'a 

For brevity we will let 

(5) U=(q'y“p, U'= p'q=? 

and note that U’ is the conjugate of U; cf. Exercise 6, § 25. Equa- 


tion (4) may then be written 

(6) Ua, =a,U’. 

From this equation we infer at once the following further ones : 
Ua, = Ua, U' au 


U's, = eal ge ek 
(7) eee 


U'a, = va,u" toa ut 


Let us now multiply the equations (6) and (7) and also the equa- 
tion a, =a, by any set of scalar constants and add them together. 
We see in this way that if y (U) is any polynomial in U, 


(8) xX(U) a, = ayx(U"). 


* A proof of this theorem much simpler than that given in the text is the following: 

From (1) we infer at once that a; and ay have the same rank. Hence the quad- 
ratic forms of which a; and ay are the matrices are equivalent to each other by Theorem 
4, § 46, If we denote by P the matrix of the linear transformation which carries over 
the quadratic form a, into the form ag, we see, from Theorem 1, § 43, that equation (2) 
holds. 

This proof would not enable us to infer that P can be expressed in terms of p and q 
alone, and this is essential for our purposes, 


A 
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We will choose the polynomial 
: V=xU) 
so that V is non-singular and 
V2=.U, 
as is seen to be possible by Theorem 2, § 101. Denoting by V’ the 
conjugate of V, we evidently have 


V'=x(U'), 


so that we may write (8) in the form 
Vaj—a, V7, 


or a, = Vi ta,V', 


We now substitute this value in (1) and get 
(9) a,=pV 4a,V"q. 


From the first equation (5) we infer the formula 


pV !=q’V. 


Consequently pV! is the conjugate of V’q, so that if we let 
P=V'q, 


equation (9) may be written 
a, = P'a,P; 
and our theorem is proved. 
The proof just given enables us to add the 


CoroLuARy. As the matrix P of the foregoing theorem may be 
taken the matrix V'q where V' is the conjugate of any one of the square 
roots, determined by Theorem 2, § 101, of (q')"!p. 


In particular it will be seen that P depends on p and q but not on 
a, ora, Hence if a,, a, b,, b, are symmetric matrices, and there 
exist two non-singular matrices p and q such that 


a, = pa,q, b, = pb,q, 


then there exists a non-singular matrix P such that 


a, = Pla,P, b, = P’b,P. 


302 INTRODUCTION ‘TO HIGHER ALGEBRA 


From this and Theorem 2, § 96, we infer 

THEOREM 2. Jf aj, a), b,, by, are symmetric matrices of which d,, dy 
are non-singular, a necessary and sufficient condition that a non-singu- 
lar matrix P exist such that 


(10) ° a, = P'a,P, b, = P'b,P, 


where P' ts the conjugate of P, is that the matrices 
a, — Ab, a, — Ab, 


have the same invariant factors, — or, if we prefer, the same elementary 
divisors. 
If, in particular, b, = b, =I, where I is the unit matrix, we have, 
from the second equation (10), the formula 
I=P'P. 


Such a matrix P we call an orthogonal matrix according to the defini- 
tion, which will readily be seen to be equivalent to the one given in 
the first footnote on page 154: ' 


DEFINITION. By an orthogonal matrix we understand a non-singu- 
lar matrix whose inverse is equal to its conjugate. 


In the special case just 3 referred to, Theorem 2 may be stated in the 
following form: 


THEOREM 8. Jf a, and a, are two symmetric matrices, a necessary 
and sufficient condition that an orthogonal matrix P exist such that 


=P ae 
is that the characteristic matrices of a, and a, have the same invariant 
factors, —or, if we prefer, the same.elementary divisors. 
If this theorem is compared with Theorem 38, § 96, it will be seen 
that it differs from it only in two respects, first that a, and a, are 


assumed to be symmetric, and secondly that P is required to be 
orthogonal. 


103. The Equivalence of Pairs of Quadratic Forms. Let us con- 
sider the two pairs of quadratic forms 


13 


n 
— I — 
$¢,= % Mj U, Bp ry = dbf a2, 


2a 


n 
ee " poe I 
and : dy — 2 Ay Ui; Xjy ro = VO wiXyy 
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of which the two forms , and yy, are assumed to be non-singular. 
We will inquire under what conditions these two pairs of forms are 
equivalent ; that is, under what conditions a linear transformation 


Ty = Cy ovr + Cyn hp 
Ln = Cnty + > +E Can y 
exists which carries over ¢, into ¢, and, at the same time, y, into Wy. 

If we denote the conjugate of the matrix c by c’, and the ma- 
trices of the forms ¢,, W,, $a Ww, by a, by, ag, b, respectively, we 
know, by Theorem 1, § 43, that the transformation c carries over ¢, 
and yy, into forms with the matrices 

a,c, c’b,c 


respectively; so that, if these are the forms ¢, and ,, we have 
(1) i 6.8, b,c Dic. 


Consequently, by Theorem 2, § 102, the two A-matrices 
a, — Ab,, a, — Ab, 

have the same invariant factors and elementary divisors. 

Conversely, by the same theorem, if these two A-matrices have 
the same invariant factors (or elementary divisors), a matrix c, inde- 
pendent of A, exists which satisfies both equations (1); and hence - 
the two pairs of quadratic forms are equivalent. Thus we have 
proved 

THEOREM 1. Jf ¢,, W, and dy, Wy are two pairs of quadratic 
forms in n variables, in which ww, and wv, are non-singular, a necessary 
and sufficient condition that these two pairs of forms be equivalent is 
that the matrices of the two pencils 


$,— Ap $2 — Athy 
have the same invariant factors, —or, if we prefer, the same elemen- 
tary divisors.* 
A special case of this theorem which is of considerable impor- 
tance is that in which both of the forms w, and y, reduce to 
A + a oe a3. 


*¥For brevity, we shall speak of these invariant factors and elementary divisors as 
the invariant factors and elementary divisors of the pairs of forms $1, 1 and $2, P2 
respectively. 
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In this case we have to deal with orthogonal transformations (cf. the 
Definition in Exercise 1, § 52), and our theorem may be stated in 
the form * 


THEOREM 2. Jf a, and a, are the matrices of two quadratic forms, 
a necessary and sufficient condition that there exist an orthogonal trans- 
formation which carries over one of these forms into the other is that the 
characteristic matrices of a, and a, have the same invariant factors, 
— or, if we prefer, the same elementary divisors. 


To illustrate the meaning of the theorems of this section, let us 
consider again briefly the problem of the simultaneous reduction of 
two quadratic forms to sums of squares. In Chapter XIII we be- 
came acquainted with two cases in which this reduction is possible ; 
cf. Theorem 2, § 58, and Theorem 2,§ 59. We are in a position now 
to state a necessary and sufficient condition for the possibility of 
this reduction, provided that one of the two forms is non-singular. 

For this purpose, consider the two quadratic forms 


b= khyxt + kyxe + +--+ + kx, 

ar = 0,07 + C42 + «-- + 6,23, 
where we assume, in order that the second form may be non-singular, 
that none of the c’s vanish. The matrix of the pencil ¢ — Ap is 


k,—¢r 0 i ae 0 
0 kj-ery 0 -: 0 


0 0 QO + ky—e,r 
and the elementary divisors of this matrix are 


(ok, Nohaeeens ye 
Cy Co Cn 
all of the first degree. Consequently, any pair of quadratic forms 
equivalent to the pair just considered must have a X-matrix whose 
elementary divisors are all of the first degree. 
Conversely, if we have a pair of quadratic forms, of which the 
first is non-singular, whose A-matrix has elementary divisors all of 


* This theorem is, of course, essentially equivalent to Theorem 3, § 102, of which it 
may be regarded as an immediate consequence. 
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the first degree, we can obviously choose the constants k and e¢ 
in such a way that the A-matrix of the forms ¢ and w just con- 
sidered has these same elementary divisors, and therefore the given 
forms are equivalent to these special forms ¢ and y~. Thus we have 
proved the theorem : 


THEOREM 3. If ¢ and w are quadratic forms and wp is non-singu- 
lar, a necessary and sufficient condition that it be possible to reduce > 
and xy simultaneously by a non-singular linear transformation to forms 
into which only the square terms enter is that all the elementary divisors 


of the pair of forms be of the first degree. 


This theorem obviously includes as a special case Theorem 2 of 
§ 58, since the elementary divisors are necessarily of the first degree 
when the A-equation has no multiple roots. 

Comparing the theorem just proved with Theorem 2, § 59, we see 
that under the conditions of that theorem the elementary divisors 
must be of the first degree. Hence 


THEOREM 4. Jf wis a non-singular, definite, quadratic form, and 
¢ is a real quadratic form, all the elementary divisors of this pair of 
forms are necessarily of the first degree. 


104. Classification of Pairs of Quadratic Forms. We consider 
the pair of quadratic forms 


(1) o= LAU = 2b jx iyp 


and assume, as before, that y is non-singular. We denote the ele- 
mentary divisors of these forms, as in § 99, by 


(A —Ay)*; (AX — Ag) as (A— Ne) (e, +e,+ + $e, =). 


The symbol [e, e, ---e,] we call the characteristic of the pair of 
quadratic forms; and all pairs of quadratic forms which have the 
same characteristic we speak of as forming a category.* 

We have here, precisely as in the case of bilinear forms, the 
theorem: 


THEOREM. If r,, Ay - Ay are any constants, equal or unequal, 
and €;, ey, +++ e,are any positive integers whose sum is n, there exist pairs 


* Thus, for instance, all pairs of forms of which the second is non-singular and which 
admit of simultaneous reduction to sums of squares, form a category whose character- 
istic is [1 1---1]. Cf. Theorem 38, § 103. 

ze 
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of quadratic forms in n variables, the second form in each pair berg 
non-singular, which have the elementary divisors 


(2) (A — ry) (N— Ag) ore (AX — ry) 


The proof of this theorem consists in considering the following 
pair of quadratic forms, analogous to the normal form (3') of § 99: 


ey QL e+e, €;+e,-1 
* Peas, ae = AX, Le-1 ) + (= Rolo (U2, + 69-11 e z def Tete -) 
1 1 etl 
Bee: +( > Apll Lon—e, 41 + = dee —p-0,-4) 
(3) ne, +1 n—ep +1 
ey +e, erbeates 
v= Boye Voi] Ds é ae Voete—1ty+ & Cg%iXne+oertes-it 
ey tegt1 
n 
toh FL CL one ity? 
n—ep+1 


where ¢,, +++ ¢, dy, --- d, are constants which may be chosen at pleas- 
ure, provided none of them are zero. 

The A-matrix of this pair of forms is the same as the \-matrix of 
the pair of bilinear forms (3’) of § 99, and therefore has the desired 
elementary divisors. 

A reference to Theorem 1, § 103, shows that formula (3) yields a 
normal form to which every pair of quadratic forms, of which the 
second is non-singular and whose elementary divisors are given by 
(2), can be reduced. 

The categories, of which we have so far spoken, may be divided 
into classes by the same methods we used in § 99 in the case of 
bilinear forms. This may be done, as before, either by simply noting 
which of the ,’s are equal to each other, or by further distinguishing 
between the cases where some of the X,’s are zero. 

We are now in a position to see exactly in what way our elemen- 
tary divisors give us a more powerful instrument than we had in the 
invariants ©, of § 57. These invariants ©, being the coefficients of 
the \-equation of our pair of forms, determine the constants 2,, which 
are the roots of this equation, as well as the multiplicities of these 
roots. They do not determine the degrees e; of the elementary di- 
visors, and the use of the @,’s alone does not, in all cases, enable us 
to determine whether two pairs of forms are equivalent or not. 
Thus, for instance, we may have two pairs of forms with exactly the 
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same invariants ©; but with characteristics [(11)11.-+--1] and 
[211 .--- 1] respectively.* It will be seen, therefore, that the 0,’s 
form in only a very technical sense a complete system of invariants. 


EXERCISES 


1. Form a numerical example in the case n =8 to illustrate the statement 
made in the next to the last sentence of this section. 


2. Prove that if two equivalent pairs of quadratic forms have two elementary 
divisors of the first degree which correspond to the same linear factor, there exist 
an infinite number of linear transformations which carry over one pair of forms 
into the other. 


3. Prove the general theorem, of which Exercise 2 is a special case, namely, 
that if two equivalent pairs of quadratic forms have a characteristic in which one 
or more parentheses appear, there exist an infinite number of linear transforma- 
tions which carry over one pair of forms into the other. 


4. Prove that if two equivalent pairs of quadratic forms have a characteristic 
in which no parentheses appear, only a finite number of linear transformations 
exist which carry over one pair of forms into the other. ¢ 

How are these transformations related to each other ? 


105. Pairs of Quadratic Equations, and Pencils of Forms or Equa- 
tions.t In dealing with quadratic forms, the questions of equiva- 
lence and classification do not always present themselves to us in 
precisely the form in which we have considered them in the last two 
sections. We frequently have to deal not with the quadratic forms 
themselves but with the equations obtained by setting the forms 
equal to zero. Two such pairs of equations we shall regard as 
equivalent, not merely if the forms in them are equivalent, but also 
if one pair of forms can be obtained from the other by multiplication 
by constants different from zero. 

Let us consider two quadratic forms ¢, Ww, of which we assume, 
as before, that the second is non-singular, and inquire what the 
effect on the elementary divisors 


(1) Oe ey Wa See (0-24) 


* We may, in the case n =38, put the same thing geometrically (cf. the next sec- 
tion) by saying thatit is impossible to distinguish between the case of two conics having 
double contact and that of two conics having simple contact at asingle point by the use 
of the invariants @; alone, whereas these two cases are at once distinguished by the use 
of elementary divisors. « 

+ The exercise in § 58 is practically a special case of this. 

t Questions similar to those treated in this section might have been Taken up in 
the last chapter for the case of bilinear forms. 
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of these forms will be if the forms are multiplied respectively by the 
constants p, g which are both assumed to be different from zero. 


Let us write 
$,=Ppp, V1 =". 


Then 
(2) $, — Av, =p ($— rh) 
where v= TX: 


iy 

Let \ — « be any one of the linear factors of the matrix of ¢ — Ap, 
so that @ is any one of the constants Aj, A, --- Az; and let us denote, 
as in the footnote to Definition 3, § 92, by J; the exponent of the 
highest power of %—a which is a factor of all the 7-rowed deter- 
minants of this matrix. Then it is clear, from (2), that 7; is the 
exponent of the highest power of ’ — « which is a factor of all the 
t-rowed determinants of the matrix of ¢,—Ay,. In other words, 


oe 
qg 


is the highest power of the linear factor \ — pa/q which is a factor 
of all the 7-rowed determinants of the matrix of ¢,—Ayw,. Turning 
now to the definition of elementary divisors as given in the footnote 
to Definition 3, § 92, we see that the elementary divisors of the matrix 
of d, — AW, differ from those of the matrix of ¢— Aw only in having 
the constants A; replaced by the constants pA,/g. We thus have the 
result : 


THEOREM 1. Jf the pair of quadratic forms $, , of which the 
second is assumed to be non-singular, has the elementary divisors 
(Ms Qayy nee (X-ray, 


and if p, q are constants different from zero, then the pair of quadratic 
forms ph, qv has the elementary divisors 


(X = Mts (r 3 do) Nee ob (r = ry)*e 
where A= L XS 
q 


In particular, it will be seen that these two pairs of forms have 
the same characteristic, even when the conception of the character- 
istic is refined not merely by inserting parentheses but also by the 
use of the small zeros. 
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The theorem just proved shows that pairs of homogeneous 
quadratic equations, of which the second equation in each pair is 
non-singular, may be classified by the use of their characteristics 
precisely as was done in the last section for pairs of quadratic forms. 
We proceed to illustrate this in the case n = 3, where we may con- 
sider that we have to deal with the classification of pairs of conics 
in a plane, one of the conics being non-singular. 

We have here three categories represented by the following 
‘normal forms :* 
pe ti] P HAW + Agey — Ayers 


ety = Ge 
p= 24+23 ae. 


U. [21] Le a eet 
w= 22,2,4+ 23. 
Il. [3] ¢= ee Ly Ly + A423 + 2042, 
i pe 22,7, + 23. 


We next subdivide these categories into classes, and, by an ex- 
amination of the normal form in each case, we are enabled at once to 
characterize each class by certain projective properties which it has, 
and which are shared by no other class.} Since the conic > is non- 
singular in all cases, this fact need not be explicitly stated. 


[111] q¢and wy intersect in four distinct points. 

[(11)1] ¢ and w have double contact. 

[(111)] ¢ and wp coincide. 

[21] ¢ and w meet in three distinct points at one of which 

they touch. 

[(21)] and wy have contact of the third order. 

[3] ¢ and wy have contact of the second order. 

In all of the above cases ¢, as well as wf, is non-singular. 

In the next five cases, ¢ consists of a pair of distinct straight 
lines. 


* We assign to the constants c; and k;, in formula (3) of the last section, values so 
chosen that the loci ¢ =0, y =0 are real when the constants \; are real. This is, of 
course, not essential, since we are not concerned with questions of reality. 

+ In order to verify the statements made below, the reader should have some 
knowledge of the theory of the contact of conics; cf. for instance Salmon’s Conic 
Sections, Chapter XIV., pages 232-238. 
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0 
[aed zl ¢ and intersect in four distinct points. 
[a a) 1] Both of the lines of which ¢ consists touch p. 


[2 1] One of the lines of which ¢ consists touches Ae while 
the other cuts it in two points distinct from the point 
of contact of the first. 


0 
[21] The two lines of which ¢ consists intersect on yz, and 
neither of them touches w. 


0 
[3] The two lines of which ¢ consists intersect on y, and 
one of them touches wf. 


In the next two cases, ¢ consists of a single line. 

r( 1) 1] The line ¢ meets ¥ in two distinct points. 

(2 1)] The line ¢ touches yp. 

Finally we have the case: 

[a 1 1]) Here ¢=0, and we have no conic other than y. 


Suppose finally that we wish to classify not pairs of quadratic forms 
or equations but pencils of quadratic forms or equations. Consider the 
pencil of quadratic forms $b — dp 


where ¢ and w are quadratic forms, and y is non-singular, and 
suppose that the elementary divisors of the pair of forms ¢, W are 
given by formula (1) above. The question presents itself whether, 
if, in place of the forms ¢, ~, we take any other two forms of the 


Bona) $= $— yp, Pi=o—ryp, 


the constants , v being so chosen that wv and that yr, is non- 
singular, the pair of forms ¢,, W,, will have these same elementary 
divisors (1). If this were the case, we could properly speak of (1) 
as the elementary divisors of the pencil. This, however, is not the 
case, and the pencil of quadratic forms cannot properly be said to have 
elementary divisors.* 


* We here regard the pencil as merely an aggregate of an infinite number of 
quadratic forms, namely, all the forms which can be obtained from the expression 
% — dy by giving to d different values. In this sense we cannot speak of the elementary 
: eeu, of the pencil, If, however, we wish to regard the polynomial in the x’s and X, 

— ry, as the pencil, we may speak of its elementary divisors, meaning thereby simply 
Fie we have called the elementary divisors of the pair of forms ¢, yw. 


PAIRS OF QUADRATIC FORMS 311 


There is, however, a simple relation between the elementary 
divisors of two pairs of forms taken from the same pencil. In order 
to show this, let us détermine the elementary divisors of the pair of 
forms ¢,, ,, above. For this purpose consider the expression 
$, — AW, which, when 7 #1, may be written 





(3) $,— Av, = —A)[o—- Mar] 
where aie : = : 


Now suppose, as above, that \ — a is any one of the linear factors of 
the matrix of ¢—)Aw, and that J; is the exponent of the highest 
power of ) — e which is a factor of all the 7-rowed determinants of 
this matrix. Then any one of the ¢-rowed determinants of the ma- 
trix of @6— A’ may, when 1 # 1, be written in the form 
Qn! — aif") 

where f is a polynomial in 0’ of degree not greater than 7 — ];. 
Accordingly, by (3), the corresponding ¢-rowed determinant of the 
matrix of ¢, —Ay, may be written 


[w — va — «(1 — 2) Jif) 


where f, is a polynomial in X. Thus we see that 


a ph li 
eal 
is a factor of every t-rowed determinant of the matrix of ¢, — Ay. 
Similar reasoning, carried through in the reverse order, shows that 
this is the highest power of Ate et 
a—v 
which is a factor of all these 7-rowed determinants. Hence 
THEOREM 2. Jf the pair of quadratic forms $, Wy, of which the 
second is non-singular, have the elementary divisors 
(A — )4)*5 OX Rhy) tea ttt (A — Az)P*s 
and if u,v are any two constants distinct from each other and such 
that v is distinct from all the constants d1, Ags +++ Az then the two forms 


d=o-—eh h=h—vy 
of which the second will then be non-singular, will have the elementary 
divisors (A cat Xs, (A ie My ae (A— ni, 


where N= Sok (¢=1, 2, -:- &). 
= 
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In particular, it will be seen that the two pairs of forms ¢, w and 
$1, W, have the same characteristic [e, e, --- e,] even if we put in 
parentheses to indicate which of the e’s correspond to equal X,’s. 
The characteristics will not, however, necessarily be the same if we 
put in small zeros to indicate which of the e’s correspond to vanish- 
ing A,’s, since A; and A} do not usually vanish together. Accord- 
ingly, in classifying pencils of quadratic forms, we may use the 
characteristic of any pair of distinct forms of the pencil, the second 
of which is non-singular, but we must not introduce the small zeros 
into these characteristics. This classification, of course, applies only 
to what may be called non-singular pencils, that is, pencils whose 
forms are not all singular. 

It will readily be seen that what has just been said applies with- 
out essential change to the case of pencils of homogeneous quadratic 
equations. We may therefore illustrate it by the classification of 
non-singular pencils of conics.* We have here six classes of pencils 
which we characterize as follows: 

[111] The conics all pass through four distinct points. 

[(11)1] The conics all pass through two points at which they 

have double contact with each other. 

[(111)] The conics all coincide. 

[2 1] The conics all pass through three points at one of 

which they touch one another. 

[(2 1)] The conics all pass through one point at which they 

have contact of the third order. 

[3] The conics all pass through two points, at one of 

which they have contact of the second order. 


EXERCISES 
1. Determine, by the use of elementary divisors, the nature of each of the 
following pairs of conics: 
(a) ee + 8a122—10 x2.243+4 2,23 =0 
2a7+3 03-2344 2192-— 6 x2%3+ 62123 = 0. 
(0) as Bia ie 3.13 — 3 2129 +8 273 + 1123 —0 
2aj+ag— 23-2 21x22 —2 roa +2 2123 =0. 
2. Give a classification of pairs of binary quadratic equations, the second 
equation of each pair being non-singular, and interpret the work geometrically. 


* For a similar classification of pencils of quadrics we refer to p. 46 of Bromwich’s 
book: Quadratic Forms and their Classification by Means of Invariant Factors. 
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106. Conclusion. We wish, in this section, to point out some 
of the important questions connected with the subject of elementary 
divisors, which, in order to keep our treatment within proper limits, 
we have been obliged to leave out of consideration. 

If ¢, yw, and ¢, Ww, are two pairs of bilinear or quadratic 
forms of which y,, , are non-singular, we have found a method 
of determining whether these two pairs of forms are equiva- 
lent or not. If we use the invariant factors instead of the ele- 
mentary divisors, our method involves only the use of the rational 
operations (addition, subtraction, multiplication, and division). 
and can, therefore, be actually carried through in any concrete 
case. In fact we have explained in § 93 some really practical 
methods of determining the invariant factors of a A-matrix, so 
that the problem of determining whether or not two pairs of 
bilinear or quadratic forms, the second form in each pair being 
non-singular, are equivalent, may be regarded as solved, not 
merely from the theoretical, but also from the practical point of 
view. 

There is, however, another question here, which we have not 
treated, namely, if the two pairs of forms turn out to be equiva- 
lent, to find a linear transformation which carries over one into 
the other. This problem, too, we may consider that we have 
solved from a theoretical point of view; for the proof we have 
given that if two pairs of forms have the same elementary 
divisors there exists a linear transformation which carries over 
one pair of forms into the other, consisted, as will be seen on 
examination, in actually giving a method whereby such a linear 
transformation could be determined. In fact, in the case of bilinear 
forms, the processes involved are, here again, merely the rational 
processes; so that, given two equivalent pairs of bilinear forms, the 
second form of each pair being non-singular, we are in a position to 
find, in any concrete case, linear transformations of the 2’s and y’s 
which carry over one pair of forms into the other. Even here the 
arrangement of the work in a practical manner might require 
further consideration. 

In the case of quadratic forms the problem becomes a much more 
difficult one, inasmuch as the processes involved in the determination 
of the required linear transformation are no longer rational; cf. the 
Lemma of § 101. That this is not merely a defect of the method we 
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have used, but is inherent in the problem itself, will be seen by a con- 
sideration of simple numerical examples. Let, for instance, 


$, = 2x74 323, b, = 242 — 323, 

Wi= vit x ho= vi— 2}. 
Here the pairs of forms ¢,, y, and ¢,, y, both have the elementary 
divisors 1-2 1g 


and are therefore equivalent. The linear transformation which 
carries over one pair of forms into the other cannot, however, be 
real (and therefore its coefficients cannot be determined rationally 
from the coefficients of the given forms) since ¢, and y, are definite, 
¢, and vf, indefinite. 

We have, therefore, here the problem of devising a practical 
method of determining a linear transformation which carries over a 
first pair of quadratic forms into a second given equivalent pair. A 
method of this sort, which is a practical one when once the elemen- 
tary divisors have been determined, will be found in Bromwich’s 
book on quadratic forms referred to in the footnote on p. 312. 

Another point at which our treatment is incomplete is in the 
restriction we have always made in assuming that, in the pair of 
bilinear or. quadratic forms ¢, y, the form y is non-singular. Al- 
though this is the case in many of the most important problems to 
which one wishes to apply the method of elementary divisors, it is still 
a restriction which it is desirable to remove. ‘This may be done in, 
part by making use not, as we have done, of the pencil 6—Ay, but 
of the more general pencil wd — ry, w and » being variable param- 
eters. The determinants of the matrix of this pencil are binary 
forms in (, X), and the whole subject of elementary divisors admits 
an easy extension to this case, the elementary divisors being now 
integral powers of linear binary forms. The only case which can- 
not be treated in this way is that in which not only ¢ and w are 
both singular, but every form of the pencil wd—AvW is singular. 
This singular case, which was explicitly excluded by Weierstrass in 
his original paper, requires a special treatment which has been given 
by Kronecker. Cf., for the case of quadratic forms, the book of 
Bromwich already referred to. 

Still another question is the application of the method of ele- 
mentary divisors to the case in which the two forms ¢, w are real, 
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and only real linear transformations are admitted. In the case of 
bilinear forms, this question presents no serious difficulty; cf. the 
exercises of §$ 97, 99. In the case of quadratic forms, however, the 
irrational processes involved in the proof of the Lemma of § 101 
introduce an essential difficulty, since they are capable of introduc- 
ing imaginary quantities. Moreover, this difficulty does not lie 
merely in the method of treatment. The theorems themselves 
which we have established do not remain true, as is seen by a refer- 
ence to the numerical example given earlier in this section for an- 
other purpose, where we have two pairs of real quadratic forms 
which, although they have the same elementary divisors, are not 
equivalent with regard to real linear transformations. 

We must content ourselves with merely mentioning this impor- 
tant subject, and referring, for one of the fundamental theorems, to 
p- 69 of the book of Bromwich. 

For further information concerning the subject of elementary 
divisors the reader is referred to Muth’s Theorie und Anwendung der 
Elementarthetler, Leipzig, Teubner, 1899. In English, the book of 
Bromwich already referred to and some sections in Mathews’ revision 
of Scott’s Determinants will be found useful. 
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symmetric functions, 255. 
Biquadratic, binary, 260. 
Boole, 260. 
Bordered determinants, 28, 156-160. 
Bromwich, 270, 312-315. 
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of a pair of quadratic forms, 305. 
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of quadratic forms, 148. 
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law for matrices, 63. 
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Complete system of invariants, 93. 
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Descriptive property, 88, 232. 
Determinant, 20; 

adjoint of, 30; 

bordered, 28, 156-160; 

Laplace’s development of, 26; 

matrix of, 21; 

minors of, 22; 

of a bilinear form, 114; 

of a matrix, 21; 

of a transformation, 66; 

orthogonal, 154; 
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Determinant — Continued. 
product of two, 26; 
rank of, 22; 
skew-symmetric, 59; 
symmetric, 56. 
Discriminant of a binary biquadratic, 260; 
of a binary cubic, 239; 
of a binary form, 237, 259; 
of a polynomial in one variable, 250; 
of a quadratic form, 128; 
of a quadric surface, 118. 
Division of polynomials, 180; 
of \-matrices, 278; 
of matrices, 75. 
Divisor of zero, 65, 80. 
Domain of rationality, 175, 212, 216. 
Dyad (dyadic polynomial), 79. 
Dyalitic method of elimination, 199. 


Element of a determinant or matrix, 20; 
of a set, system, or group, 81. 
Elementary divisors of a \-matrix, 271; 
of a collineation, 286 ; 
of a pair of bilinear forms, 284; 
of a pair of quadratic forms, 303; 
simple, composite, 270. 
Elementary symmetric function, 242, 253. 
Elementary transformation of a matrix, 55; 
of a \-matrix, 262. 
Elimination, 198, 217, 238. 
Equations, linear, 43; 
homogeneous, linear, 47; 
quadratic, 149; 
quadratic, pairs of, 307; 

* quadratic, pencils of, 312. 
Equianharmonice points, 107. 
Equivalence, 92. 

Equivalent matrices, 55, 93; 
collineations, 286; 
A-matrices, 263, 274; 
pairs of bilinear forms, 283; 
pairs of matrices, 281; 
pairs of quadratic forms, 170, 303; 
quadratic forms, 135, 148. 
Euclid’s algorithm, 189, 192, 206. 
Euler’s theorem for homogeneous functions, 
237. 


Factors of a polynomial, 174, 187, 203; see 
invariant factors. 
Fixed points of a collineation, 285; 
lines, planes of a collineation, 286. 
Forms, 4; see bilinear, quadratic, binary 
forms ; 
biquadratic, 260; 
cubic, 239; 
polar, 127. 
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Fours group, 87. 

Fractional matrices, 86. 

Frobenius, 262, 270. 

Fundamental system of solutions, 49; 
theorem of algebra, 16. 


Generator of a quadric surface, 119. 
Gibbs, 79. 
Greatest common divisor of integers, 188; 
of polynomials in one variable, 191, 197; 
of polynomials in two variables, 206. 
Ground-form, 96. 
Group, 80; 2 
Abelian or commutative, 83; 
cyclic, 87; 
fours group, 87; 
isomorphic, 83; 
sub-, 83. 
Group property, 82. 


Hamilton, 79, 296. 
Harmonie division, 104. 
Homogeneity, principle of, 226. 
Homogeneous coordinates, 11; 
invariants, 230; 
linear equations, 47, 49; 
polynomials, 4. 


Idemfactor, 74. 
Identical vanishing (equality) of polynomi- 
als, 2, 5, 7, 10; . 
element.of a group, 82; 
transformation, 67. 
Indefinite quadratic form, 150. 
Index of inertia of quadratic form, 146. 
Invariant, absolute algebraic, 89; 
arithmetical, 91, 94, 115, 124, 129, 146, 287; 
complete system of, 93; 
geometric, 88, 103; 
homogeneous, 230; 
integral rational, 99, 101, 115, 129, 137, 
159, 166, 218, 259, 260; 
irrational, 167, 259, 260; 
rational, 96, 222; see also integral, ra- 
tional; 
relative algebraic, 96, 115; see integral 
rational. 
Invariant factors of a \-matrix, 269; 
of a collineation, 286; 
of a pair of bilinear forms, 284; 
of a pair of quadratic forms, 303. 
Inverse of a transformation, 67 ; 
of an element of a group, 82; 
of a matrix, 75, 80; 
of a quadratic form, 160. 
Isobaric polynomial, 222; 
symmetric function, 245, 255, 256. 
Isomorphic groups, 83. 
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Jacobi, 144. 


Kronecker, 4, 139, 262, 314. 


A-equation of two conics, 164; 
of two quadratic forms, 166. 
A-matrix, 262. 
Lagrange’s reduction, 131. 
Laplace’s development, 26. 
Law of Inertia, 144; 
of Nullity, 78, 80. 
Line at infinity, 13. 
Line-coérdinates, 108, 110. 
Linear dependence, conditions for, 36-38; 
of geometric configurations, 39; 
of polynomials, 35, 38; 
of sets of constants, 35, 48. 
Linear equations, 43, 47, 49; 
transformations, 66. 
Linear factors of polynomials in one vari- 
able, 187; 
of binary forms, 188; 
of \-matrices, 270. 


Matrix, theory of, 20-22, 54-66, 74-80, 86, 93, 

262-283, 296-302 ; 

adjoint of, 77, 80; 

as a complex quantity, 60; 

augmented, 44; 

conjugate, 21; 

determinant of, 21; 

division of one by another, 75; 

elementary transformation of, 55; 

equivalent, 55; 

fractional, 86; 

inverse of, 75; 

multiplication by matrix, 63; 

multiplication by scalar, 62; 

normal form of a, 56 (Exercise 3) ; 

normal form of a symmetrical, 59; 

of a bilinear form, 114; 

of a determinant, 21; 

of a quadratic form, 128; 

of a quadric surface, 118; 

of a system of linear equations, 44; 

of a transformation, 66. 

orthogonal, 154, 302, 304; 

powers of, 75; 

product of two, 63; 

rank of, 22; 

rank of product of two, 77; 

scalar, 76; 

similar, 283; 

singular, 65; 

skew-symmetric, 59; 

sum or difference of two, 62; 
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Matrix — Continued. 
symmetric, 56; 
transposed, 21; 
unimodular, 83; 
unit, 74. 
Minors of a determinant, 22; 
complementary, 22; 
corresponding, 31; 
principal, 23, 57-59. 
Mixed concomitants, 109. 
Moore, 86. 
Multiplication theorem, 28. 
Multiplicity of roots of an equation, 18; 
of pieces of curves and surfaces, 211, 214. 


Neighborhood of a point, 8, 16, 214. 
Newton’s formule, 244. 
Normal form of a bilinear form, 116; 
of a binary biquadratic, 261; 
of a binary cubic, 239; 
of a \-matrix, 267; 
of a matrix, 56 (Exercise 3) ; 
of a pair of bilinear forms, 289; 
of a pair of quadratic forms, 169, 171, 306; 
of a quadratic form, 135; 
of a quadric surface, 124; 
of a real quadratic form, 148; 
of a symmetrical matrix, 59. 
Nullity, Sylvester’s Law of, 78, 80. — 
Null-system, 117. 


Order of a determinant or matrix, 20; 
of a group, 87. 
Orthogonal transformation, matrix, determi- 
nant, 154, 173, 302, 304. 


Pencil of conics, 163, 312; 
of bilinear forms, 279; 
of quadratic forms, 165, 310. 

Period of an element of a group, 87. 

Plane at infinity, 13; 
conjugate, 158. 

Plane-coordinates, 107. 

Point in space of n dimensions, 9; 
at infinity, 12; 
conjugate, 121; 
equation of a, 107, 108; 
neighborhood of a, 8, 16, 214. 

Polar plane, 122; 
form, 127; 
tetrahedron, 125. 

Pole, 124. 

Polynomial, definition, degree of, etc., 1, 4; 
continuity of a, 14; 
corresponding, 178; 
dyadic, 79; 
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in a matrix, 296; 

isobaric, 222; 

linear dependence of, 35; 

real, 5; 

roots of a, 18; 

symmetric, 240, 252. 
Prime (relatively) polynomials, 175. 
Principal minors, 23, 57-59. 
Product, degree of, 1, 4; 

of determinants, 26; 

of matrices, 63, 277. 
Projective transformation, 69; 

property, 88, 232. 
Pseudo-tangent lines and planes, 120. 


Quadratic forms, 127; 
adjoint of, 159; 
definite and indefinite, 150; 
invariants of, 129, 137, 146, 159, 165, 303; 
inverse or reciprocal of, 160; 
law of inertia of, 144; 
matrix, discriminant, rank of, 128; 
normal forms of, 135, 148, 169, 171, 306; 
pairs of, 165, 302; 
polar of, 127; 
real, 144; 
reducibility of, 136, 147; 
reduction of, to sum of squares, 131, 139, 
167, 170, 173; 
regularly arranged, 147; 
signature of, 146; 
singular, 128; 
vertex of, 129. 
Quadric surface, matrix, discriminant, rank 
of, 118; 
classification of, 123, 149, 173; 
ruling of, 119; 
singular, 118; 
tangent to, 119, 120, 155. 
Quantic, 4; 
Quaternary form, 5. 


Rank of a matrix or determinant, 22, 
54-59; 
of a bilinear form, 114; 
of a \-matrix, 262 ; 
of a quadratic form, 128; 
of a quadric surface, 118; 
of a system of homogeneous linear equa- 
tions, 47; 
of a system of points or linear forms, 
94; 
of the adjoint of a quadratic form, 161; 
of the product of two matrices, 77. 
Rational invariants, 96, 222; see also in- 
variant, integral, rational ; 
of a \-matrix, 270. 
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Rational relation, 244. 
Rationality, domain of, 175, 212, 216. 
Ray coordinates, 113. 
Real polynomials, 5, 174; 
matrix, \-matrix, elementary transforma- 
tion, 275; 
quadratic forms, 144-154, 161, 170-173. 
Reciprocal or inverse of a quadratic form, 
160. 
Reciprocation, 117. 
Reducibility of a polynomial, 174; 
in a domain, 174, 175; 
of bilinear forms, 116; 
of binary forms, 188; 
of determinants, 176; 
of polynomials in one variable, 187; © 
of quadratic forms, 156, 147. 
Regularly arranged quadratic form, 147. 
Resultant of linear forms, 95; 
of two binary forms, 201, 236, 239, 257; 
of two polynomials in one variable, 195, 
239, 248. 
Roots of a polynomial or equation, 18. 
Ruling of a quadric surface, 119. 


S-functions, 241, 253. 
=-functions, 240, 252. 
Sealar, 62; 
matrix, 76. 
Self-conjugate tetrahedron, 125; 
triangle, 164. 
Semi-definite quadratic form, 150. 
Set of objects, 80. 
Sen, 147. 
Signature of a quadratic form, 146. 
Similar matrices, 283. 
Simple elementary divisors, 270. 
Singular matrix, 65; 
bilinear form, 114; 
conic, 163, 272; 
linear transformation, 67; 
quadratic form, 128; 
quadric surface, 118. 
Skew-symmetric determinant, 59; 
bilinear form, 117; 
matrix, 59. 
Smith, H. J. S., 262. 
Subgroup, 83. 
Subresultant, 197. 
Sylvester, 78, 144, 199, 262. 
Symbolic product of bilinear forms, 114. 
Symmetric determinant and matrix, 56, 
299 ; 
bilinear form, 115; 
binary function, 255; 
polynomial, 240; 
polynomial in pairs of variables, 252; 
ternary function, 257. 
System, 80. 
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Tangent lines and planes to quadric surface, | Unimodular matrix, 83. 
true and pseudo, 119, 120. Unit matrix, 74. 

Ternary form, 5; 
symmetric function, 257. 

Transformation, aftine, 70; 

* determinant and matrix of, 66; 

elementary (of a matrix), 55, 262; 
identical, 67; 


Vertex of a cone, 120, 122, 123, 156; 
of a quadratic form, 129. 


inverse, 67; 
linear, 66; Weierstrass, 262, 270, 314. 
orthogonal, 154, 173, 304; Weight of an invariant, 96, 225; 
projective, 69; of a covariant, 97, 226; 
singular, 67. of a polynomial, 222; 
Trausposed matrix, 21. of a symmetric polynomial, 245, 253. 
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